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IMPROVED TRANSFORMATION EFFICIENCY IN PHAGE DISPLAY THROUGH 
MODIFICATION OF A COAT PROTEIN 



FIELD OF THE INVENTION 
5 The invention relates to fusion proteins of a polypeptide and a coat protein of a virus, 

where a coat protein is not a wild type coat protein. The invention also relates to replicable 
expression vectors which contain a gene encoding the fusion protein, host cells containing the 
expression vectors, a virus which displays the fusion protein on the surface of the virus, libraries of 
the virus displaying a plurality of different fusion proteins on viral surfaces and methods of using 

10 these compositions. 

The invention also relates to a method of transforming cells by electroporating cells to 
improve transformation efficiency. In various preferred embodiments, the transformation is 
performed in the presence of a high concentration of DNA; in the presence of a high concentration 
of cells; with highly purified DNA; with specific host cells; or with combinations of these. When 

1 5 used to prepare libraries, for example phage display libraries, these improvements allow for the 
construction of larger libraries in a single electroporation step than has been previously possible. . 
The invention is alsti directed to a rriethod of producing fa product polypeptide ; By transforming host 
cells using the method of the invention. 



20 DISCUSSION OF THE BACKGROUND 

Bacteriophage (phage) display is a technique by which variant polypeptides are displayed 
as fusion proteins to the coat protein on the surface of bacteriophage particles (Scott, J.K. and 
Smith, G. P. (1990) Science 249: 386). The utility of phage display lies in the fact that large 
libraries of selectively randomized protein variants (or randomly cloned cDNAs) can be rapidly 

25 and efficiently sorted for those sequences that bind to-a target moleciile with high affinity. Display 
of peptide (Cwirla, S. E. et ali (1990) Proc. Natl. Acad, Sci. USA, 87:6378) or protein (Lowiiian, 
H.B. et al. (1991) Biochemistry, 30:10832; Clackson, T et al. (1991) Nature, 352: 624; Markis, J. 
D. et al. (1991), J. MoL Biol, 222:581; Kang, A.S. et al; (1991) Proc. Natl Acad: Sci: USA; - 
88:8363) libraries on phage have been used for screening millions of polypeptides for ones with 

30 specific binding properties (Smith, Gi P. (1991) Current Opin. BiotechnoL, 2:668)! Sorting phage 
libraries of random mutants requires a strategy for constructing and propagating a large number of 
variants, a procedure for affinity purification using the target receptor, and a means of evaluating 
the results of binding enrichments. U:S. 5,223,409; U.S. 5,403;484; U.S. 5,571,689; U.S: 
5,663,143. ■ . 

35 Typically, variant polypeptides are fused to a gene III protein, which is: displayed at one 

end of the virbn. Alternatively, the variant polypeptides may be fused to the gene VIII protein, 
which is the major coat protein of the viron. Such polyvalent display libraries are constructed by 
replacing the phage gene III with a cDNA encoding the foreign sequence fused to the amino 
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the ligand will bind to a target molecule and a second solution in which the affinity ligand will not 
bind to the target molecule, to selectively isolate binding ligands. WO 97/4625 1 describes a 
method of biopanning a random phage display library with an affinity purified antibody and then 
isolating binding phage, followed by a micropanning process using microplate wells to isolate high 
5 affinity binding phage. The use of Staphylococcus aureus protein A as an affinity tag has also 
been reported (Li et al. (1998) Mol Biotech., 9:187). WO 97/47314 describes the use of substrate 
subtraction libraries to distinguish enzyme specificities using a combinatorial library which may be 
a phage display library. A method for selecting enzymes suitable for use in detergents using phage 
display is described in WO 97/09446. Additional methods of selecting specific binding proteins 

10 are described in U.S. 5,498,538; U.S. 5,432,018; and WO 98/15833. 

Methods of generating peptide libraries and screening these libraries are also disclosed in 
U.S. 5,723,286; U.S. 5,432,018; U.S. 5,580,717; U.S. 5,427,908; and U.S. 5,498,530. See also 
U.S. 5,770,434; U.S. 5,734,018; U.S. 5,698,426; U.S.5,763,192; and U.S. 5,723,323. 

Methods which alter the infectivity of phage are also known. WO 95/34648 and U.S. 

15 5,516,637 describe a method of displaying a target protein as a fusion protein with a pilin protein 
of a host cell, where the pilin protein is preferably a receptor for a display phage. U.S. 5,712,089 
describes infecting a bacteria with a phagemid expressing a ligand and then superinfecting the 
bacteria with helper phage containing wild type protein in but not a gene encoding protein in 
followed by addition of a protein Ill-second ligand where the second ligand binds to the first ligand 

20 displayed on the phage produced. See also WO 96/22393. A selectively infective phage system 
using non-infectious phage and an infectivity mediating complex is also known (U.S. 5,514,548). 

Phage systems displaying a ligand have also been used to detect the presence of a 
polypeptide binding to the ligand in a sample (WO/9744491), and in an animal (U.S. 5,622,699). 
Methods of gene therapy (WO 98/05344) and drug delivery (WO 97/12048) have also been 

25 proposed using phage which selectively bind to the surface of a mammalian cell. 

Further improvements have enabled the phage display system to express antibodies and 
antibody fragments on a bacteriophage surface, allowing for selection of specific properties, i.e., 
binding with specific ligands (EP 844306; U.S. 5,702,892; U.S. 5,658,727) and recombination of 
antibody polypeptide chains (WO 97/09436). A method to generate antibodies recognizing 

30 specific peptide - MHC complexes has also been developed (WO 97/02342). See also U.S. 
5,723,287; U.S. 5,565,332; and U.S. 5,733,743. 

U.S. 5,534,257 describes an expression system in which foreign epitopes up to about 30 
residues are incorporated into a capsid protein of a MS-2 phage. This phage is able to express the 
chimeric protein in a suitable bacterial host to yield empty phage particles free of phage RNA and 

35 other nucleic acid contaminants. The empty phage are useful as vaccines. 

The degree of expression of polypeptides as fusion proteins on the surface of 
bacteriophage particles is variable and depends, to some extent, on the size of the polypeptide. 
Conventional phage display systems use wild type phage coat proteins and fuse the heterologeous 



PCT/US99/16596 

WO 00/06717 

polypeptide to the amino terminus of the wild type amino acid sequence or an amino terminus 
resultmgfromtruncationofmewildtypecoatproteinsequence. Segments of linker ammo acids 
have also been added to the ammo terminus of the wild type coat protein sequence to unprove 

selection and target binding. 
5 Notwithstanding numerous modifications and improvements in phage technology, a need 

continues to exist for improved methods of displaying polypeptides as fusion proteins in phage 

display methods. . 

Methods of transforming cells to introduce new DNA are of great practical mterest » 
molecular biology and modern genetic engineering. Early methods involved chemical treatment of 

,0 bacteria with solutions of metal ions, generally calcium chloride, followed by heating to produce 
competent bacteria capable of functioning as recipient bacteria and able to take up heterologous 
DNA derived from a variety of source, These early protocols provided transformation yields of 
about 10 5 - ^transformed colonies per pgram of plasmid DNA. Subsequent improvements using 
different cations, longer treatment times and other chemical agents have allowed improvements m 

,5 tra nsformationefficiencyofuptoaboutl0 8 colonie^gramofDNA. Sambrook et al. Mo lecular 
Cloning: A Laboratory Manual, 2nd edition, (1989) Cold Spring Harbor laboratory Press, Cold 

Spring Harbor, NY, page 1 .74. 

Cells can also be transformed using high-voltage electroporation. Electroporation » 
suitable to introduce DNA into eukaryotic cells (e.g. animal cells, plant cells, etc.) as well as 
20 bacteria, e: g ,E.coli. Sambrook et al., Hi, pages 1.75, 16.54-16,5. Different cell types require 
different conditions for optimal electroporation and preliminary experiments are generaUy 
conducted to fmd acceptable levels of expression or transformation. For mammal . .cells 
voltages of 250-750 V/cm result in 20-50% cell survival. An electric pnlse length of 20-100 ms at 
a temperature ranging from room temperature to 0»C and below using a DNA concentration of 1- 
25 40 pgram/mL are typical parameters. Transfection efficiency is reported to be higher usmg hnear 
DNA and when the cells are suspended in buffered salt solutions than when suspended in neuronic 
solutions. Sambrook etal., above, pages 16.54-16.55. ...... 

Dower et al., 1988, Nucleic Acids Research, 16:6127-6145 extensively studied high- 
efficiency transformation of E. coli by high-voltage electroporation. This study evaluated 
30 numerous parameters, including electrical variables such as the effect of field strength and puke 
length the effect of DNA concentration and cell concentration on the recovery of transform^ 
accuracy, reproducibility, etc. and provided a protocol for high-efficiency eleetrotransfonnaUon of 
E. coli cells. The optimized protocol of Dower et al. uses cells concentrated in a range of at least 
and up to 4 x l0 ,0 /mL, a DNA concentration of from about 1 to 10 pgrams/mL, 12.5-16.7 kV/cm, 
35 3-25 PF and the eleetroporation is conducted at 0°C (ice temperature). These studies were 
conducted with highiy purified closed circular plasmid DNA, which is known to give high 
transformation efficiencies. Dower et al. report transformation efficiencies of 10 - 10 

4 
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tran$formants/(Jgnun of DNA achieved by highly optimizing these parameters. For library 
formation, Dower et al. suggests using a DNA concentration of less than 10 nanogram/mL and a 
cell concentration of greater than 3 x 10 10 to minimize co-transformants. See also U.S. 4,910,140 
and U.S. 5,186,800 to Dower et al. and U.S. 4,849,355 to Wong. 
5 Several attempts have been made to improve the design of electroporation apparatus (see, 

for example, U.S. 5,173,158; U.S. 5,098,843; U.S. 5,422,272; U.S. 5,232,856; and U.S. 5,283,194) 
and to improve electroporation of specific cells (see U.S. 5,128,257). U.S. 5,124,259 describes an 
improved buffer for electroporation. U.S. 4,956,288 describes a method for producing cells 
containing foreign DNA in high copy numbers. 

10 The attainment of higher transformation efficiencies by optimizing the electroporation 

parameters has been difficult. The use of higher voltages and longer pulses results in an increase in 
cell death, decreasing the total number of transformed cells. Highly optimized electroporation still 
results in about 50-75% cell death. Dower et al. represents an important investigation of the 
parameters of electroporation and the protocol described in this paper has formed the basis of more 

1 5 recent electroporation procedures 

An important emerging use of cell transformations, including electroporation, is the 
preparation of peptide and protein variant libraries. In these applications, a replicable transcription 
vector, for example a plasmid, is reacted with a restriction enzyme to open the plasmid DNA, 
desired coding DNA is ligated into the plasmid to form a library of vectors each encoding a 

20 different variant, and cells are transformed with the library of transformation vectors in order to 
prepare a library of polypeptide variants differing in amino acid sequence at one or more residues. 
The library of peptides can then be selectively panned for peptides which have or do not have 
particular properties. A common property is the ability of the variant peptides to bind to a cell 
surface receptor, an antibody, a ligand or other binding partner, which may be bound to a solid 

25 support. Variants may also be selected for their ability to catalyze specific reactions, to inhibit 
reactions, to inhibit enzymes, etc. 

In one application, bacteriophage (phage), such as filamentous phage, are used to create 
phage display libraries by transforming host cells with phage vector DNA encoding a library of 
peptide variants. J.K. Scott and G.P. Smith, Science, (1990), 249:386-390. Phagemid vectors may 

30 also be used for phage display. Lowman and Wells, 1991, Methods: A Companion to Methods in 
Enzymology, 3:205-216. The preparation of phage and phagemid display libraries of peptides and 
proteins, e.g. antibodies, is now well known in the art. These methods generally require 
transforming cells with phage or phagemid vector DNA to propagate the libraries as phage 
particles having one or more copies of the variant peptides or proteins displayed on the surface of 

35 the phage particles. See, for example, Barbas et al., Proc. Natl Acad. Sci. t USA, (1991), 88:7978- 
7982; Marks et al., J, Mol Biol, (1991), 222:581-597; Hoogenboom and Winter, 7. Mol Biol, 
(1992), 227:381-388; Barbas et al., Proc. Natl Acad ScL t USA, (1992), 89:4457-4461; Griffiths et 
al., EMBO Journal, (1994), 13:3245-3260; de Kruif et al., J. Mol Biol, (1995), 248:97-105; 
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containing a gene fusion which encodes this fusion protein. A further object is to provide host 
cells containing the replicable expression vector. 

Another object of the invention is to provide a library of the replicable expression vectors 
where the vectors contain a plurality of different gene fusions encoding a plurality of variant fusion 
5 proteins. A further object is to provide a library of virus particles which display a plurality of 

variant fusion proteins where the fusion proteins contain a coat protein variant which does not have 
the wild type coat protein sequence, and the host ceils containing the vector libraries. 

An additional object of the invention is to provide a method of constructing the libraries of 
expression vectors and virus particles. 
10 A further object is to provide a method of modulating the number of fusion proteins which 

are displayed on the surface of a phage or phagemid particle through the use of the fusion protein 
of the invention. 

Another object of the present invention is to provide an improved method of transforming 
cells by electroporating competent cells in the presence of heterologous DNA. 
15 A further object of the invention is to provide an improved strain of E. coli ceils which 

have improved characteristics and allow higher transformation yields with electroporation. 

A further object is to provide a method of producing a product polypeptide by culturing a 
host cell transformed with a replicable expression vector where the host cells have been 
transformed using the method of the present invention, and product polypeptides produced by this 
20 process. 

One embodiment of the invention is a method, which includes constructing a library 
containing a plurality of replicable expression vectors, each expression vector containing a 
transcription regulatory element operably linked to a gene fusion encoding a fusion protein, where 
the gene fusion contains a first gene encoding a first polypeptide and a second gene encoding a 

25 phage major coat protein, where the library contains a plurality of second genes encoding variant 
phage major coat proteins. The method may further include transforming suitable host cells with 
the library of vectors and culturing the transformed cells under conditions suitable to form the 
fusion proteins. Preferably, the vector is phage or phagemid DNA and the culturing is sufficient to 
form phage or phagemid particles which display fusion proteins on the surfaces thereof. The 

30 method may also include contacting the phage or phagemid particles with a target molecule so that 
at least a portion of the particles bind to the target molecule , and separating the particles that bind 
from those that do not bind. The method may further include selecting a bound particle, 
constructing a second library containing a plurality of replicable expression vectors, each 
expression vector containing a transcription regulatory element operably linked to a second gene 

35 fusion encoding a second fusion protein, where the second gene fusion contains a third gene 
encoding a second polypeptide and a fourth gene encoding the major coat protein variant of the 
fusion protein displayed on the surface of the selected bound particle, where the library contains a 
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polypeptide may be the same or different. 

Inanomeremboclimenuhemveationisamemodmdudingthesteps: 

(a) constructing library conmimng a plurality off.strepheable express^ 

vectors^chexpressionvectorcomprismgatranscnpUonre^ 

operablylinkedto a first gene fusion encoding a first fusion protein, wherem the 

first gene fusion comprises 

a first gene encoding a first polypeptide and 
a second gene encoding a phage major coat protein, and 
wherein the first library contains a plurality of first vectors encoding second genes 



encoding variant phage major coat proteins; 

(b) transfonning suitable host cells with the first library of vectors and cultunng 
thetransformedcellsunder conditions suitable to form phage or phagemid parties; 

(c) contacting thephage or phagenud particles withatarget molecule so that at 
1 5 least a portion of the particles bind to the target molecule; 

(d) separating particles that bind from those that do not bind; 

(e) selecting a particle; 

(Oconstructingasecondreplicable expression vector comprising a transcnpUon 
regulatory element operably linked to a second gene fusion encoding a seeond 
20 fusion protein, wherein the second gene fusion comprises 

a third gene encoding a second polypeptide and 

a fourth gene encoding the major coat protein variant of the fusion protem 

displayed on the surface of the selected bound particle; 
(^tmnsformmgsuitablehostc.llswiththesecondvector.cultunngthe 

the second fusion protein on the surface thereof; and 

(h) separating particles that bind from those that do not bmd. 

^methodmayalsoincludeconstractingasecondUbrary of 

^^encodingvar^ntsecondpoivpeptides. T,e method may further mclude steps. 

0 (i) selecting a particle; 

0) constmctmgathirdexpressionvectorcomprisinga transcnpUon regulatory 

0c) tnmsformmgsmtablehost^^^ 
culturingthetransformedceUsunder conditions — e for " 8 ^ ^ P " U owing 
Th-e and other objects which will become apparent in the course of the foUowmg 

35 description 

cells by electroporatmg consent cells m the presence of heterologous DNA, whe e *e DNA 
purified by affinity purification, and is preferably present at a concentrate of about 
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picogram/mL to about 500 pgram/mL. The DNA is generally present at a concentration of a few 
to several hundred nanograms/mL or greater, preferably about 1 to about 50 pgrams/mL or greater, 
even more preferably about 70 Mgrams/mL or greater, and may be present at a concentration of 
greater than 100 pgrams/mL to about 500 pgrams/mL. 
5 In part, the present invention is also based on the discovery that prior art methods of 

preparing DNA for electroporation, for example the preparation of clonable recombinant DNA, 
using phenol extractions and ethanol precipitation, have generally resulted in DNA solutions 
having unacceptably high conductance. Electroporation instruments are generally configured to 
have a sample cell in parallel with a capacitor and a resistor (R2) to control the electric pulse 

10 duration through the sample. Ideally, the resistance of the sample (Rl) should be much greater 
than that of R2 so that the electric pulse decays mainly through R2. In a preferred electroporation, 
where essentially the entire discharge occurs through R2, the time constant would approach the 
theoretical maximum where Rl is infinite. DNA is an ionic molecule, and thus DNA 
electroporation samples have an inherent conductance. Furthermore, DNA preparations containing 

15 electrically charged impurities such as proteins, salts, buffers, etc., introduce additional 
conductance. The volume of a DNA preparation (and thus the mass of DNA) which can be 
introduced into an electroporation reaction is limited by the conductance of the preparation. As the 
conductance of the sample increases, Rl decreases and becomes significant in comparison to R2, 
i.e. a significant proportion of the electric pulse is discharged through Rl. This results in a 

20 decrease in the time constant and a decrease in the transformation efficiency. Further increases in 
sample conductance result in* electric arcing across the electrodes and a failure of the 
electroporation. The high conductance of DNA solutions prepared using prior art methods 
practically limits electroporation reactions to low DNA concentrations, since higher concentrations 
results in electrical arcing. The invention solves this problem, in part, by providing a method of 

25 electroporating cells with affinity purified DNA and/or at DNA concentrations much greater than 
was thought possible. It has been discovered that the DNA in prior art DNA preparations 
contributes only a small proportion of the total conductance; the majority of the conductance in 
these preparations is due to ionic impurities. The present invention uses affinity DNA purification 
to reduce ionic impurities and thus reduce the conductance associated with a unit mass of DNA. 

30 Although the prior art generally suggests using purified DNA for electroporation and several 
standard purifications have been used, for example, DNA precipitation and membrane filtration, 
the use of affinity purification has not been utilized and the very high DNA concentrations which 
can be used in the method of the invention and the resulting high transformation yields are 
surprising. 

35 The invention provides an improved method of transforming cells by highly purifying 

DNA, for example recombinant clonable DNA, preferably closed circular DNA, more preferably 
phage or phagemid vector DNA. The invention enables one to prepare DNA solutions of high 
concentration, preferably an aqueous solution having very low conductance, for example a non- 
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Figures 4 A and 4B: Phage ELISAs for SAV display. Figure 4A shows results using anti- 
SAV polyclonal antibody as target. Figure 4B shows results using Biotin-BSA conjugate as 
target. Display was measured for SAV-protein VIII (circles), SAV-protein Vm (2e) (triangles), 
SAV-protein VIII (2f) (diamonds), and SAV- protein VIII (2a) (squares). Phage were produced 
5 from uninduced cultures. 

Figure 5: Phage ELISAs for hGH display with protein VIII variants combining mutations 
in different zones. An anti-hGH monoclonal antibody was used as target. Display was measured 
for hGH displayed as a fusion with protein VIII (filled circles), protein Vm(la) (filled squares), 
protein VIII(2a) (filled diamonds), protein Vm(3a) (filled triangles), protein Vm containing the 
10 mutations of protein VIII(la) and protein VHI(2a) (unfilled circles), or protein Vm containing the 
mutations of protein VIII(2a) and protein VHI(3a) (unfilled squares). 

Figure 6: Phage ELISAs for hGH display with protein VIII variants derived from protein 
VHI(2a). An anti-hGH monoclonal antibody was used as target. Display was measured for hGH 
displayed as a fusion with either protein VTII(2a) (filled circles) or with protein VIII(2a) containing 
1 5 the following mutation: E 1 2N (unfilled squares), D 1 6 A (unfilled circles), or 1 1 7S (unfilled 
triangles). 

Figures 7 A and 7B: Linkers selected for hGH display or SAV display. 7 A) Linkers 
selected for display of hGH. Linkers were of the form (Gly)3(Xaa) l4(Gly)2, where (Xaa) 14 is the 
selected sequence shown. 7B) Linkers selected for display of SAV. 

20 Figures 8 A and 8B: Phage ELISAs for protein display with selected linkers. 8A) hGH 

displayed on protein VIII using either a Gly/Ser linker (phagemid pS349, circles) or the linker 
selectant Linkl (Fig. 7A, squares). Phage were produced from cultures which were either 
uninduced (unfilled) or induced with 10 uM IPTG (filled). The hGHbp was used as target. 8B) 
SAV displayed on wild type protein VIII using a Gly/Ser linker (unfilled circles) or on variant 

25 protein VIII(2e) using either a Gly/Ser linker (unfilled squares), linkl 8 (filled circles), link29 
(filled squares), link34 (filled diamonds), or link37 (filled triangles). The Gly/Ser sequence was 
identical to the Gly/Ser linker encoded by pS349. The sequences of the other linkers are shown in 
Figure 7B. Biotinylated BSA was used as target. 

Figure 9: Phage ELISAs for hGH display with protein VIII and selected protein VIII 

30 variants. The hGHbp was used as target (Kd = 1.6 nM, Pearce, K.H. Jr. et aL, (1997) J. Biol. 

Chem. 272:20595-20602). Display was measured for hGH-protein VIII expressed from phagemid 
pS1607 (see Example 1 1) (circles), hGH-protein VHl(la) (squares), hGH-protein VIII(2a) 
(diamonds), and hGH-protein VIII(3a) (triangles). Phage were produced from cultures which 
were either uninduced (unfilled) or induced with 10 uM IPTG (filled). The sequences for the 

35 protein VIII variants are show in Figure 1 . 

Figure 10: Phage ELISAs for Fab display with protein VIII and a protein VIII mutant. A 
monoclonal antibody specific for a peptide flag fused to the N-terminus of the Fab heavy chain was 
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control, phage binding to a BSA-blocked plate was also measured (filled bars). The phage were 
used at a concentration of 10 13 phage/mL. See Example 24. 

Figure 1 7: Phage ELIS As for hGH or hGHsm display with P 1 2-7. Display was measured 
forhGHsm fused to the C-terminus of PI 2-7 (phagemid pS1258, open circles), hGH fused to the 
5 C-terminus of P12-7 (phagemid pW930a, open squares), hGHsm fused to the C-terminus of P12-1 
(phagemid pS1239b, filled circles), hGH fused to the C-terminus of P12-1 (phagemid pS1239a, 
filled squares), or hGH fused to the N-terminus of protein VD1 (phagemid pS1607, filled 
diamonds). An anti-hGH monoclonal antibody was used as target. See Example 25. 

Figure 1 8: Phage ELIS As for the display of a peptide fused to the C-terminus of the 
10 protein III C-terminal domain using selected linkers. A hexaHis flag was displayed with 

intervening linker sequences as follows: linker-g3-l (open circles), linker-g3-2 (open squares), 
linker-g3-3, (open diamonds). Display was also measured for polyHis flags displayed as either N- 
terminal fusions with protein VIII (filled circles) or as C-terminal fusions with protein VHT using 
optimized linker- 1 (filled diamonds). Phage derived from a phagemid not encoding a polyHis flag 
1 5 were also included as a negative control (filled squares). See Example 26. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
DEFINITIONS 

The term "affinity purification" means the purification of a molecule based on a specific 

20 attraction or binding of the molecule to a chemical or binding partner to form a combination or 
complex which allows the molecule to be separated from impurities while remaining bound or 
attracted to the partner moiety. 

The term "antibody" is used in the broadest sense and specifically covers single 
monoclonal antibodies (including agonist and antagonist antibodies), antibody compositions with 

25 polyepitopic specificity, affinity matured antibodies, humanized antibodies, chimeric antibodies, as 
well as antibody fragments (e.g., Fab, F(ab')2* scFv and Fv), so long as they exhibit the desired 
biological activity. An affinity matured antibody will typically have its binding affinity increased 
above that of the isolated or natural antibody or fragment thereof by from 2 to 500 fold. Preferred 
affinity matured antibodies will have nanomolar or even picomolar affinities to the receptor 

30 antigen. Affinity matured antibodies are produced by procedures known in the art. Marks, J. D. et 
al Bio/Technology 10:779-783 (1992) describes affinity maturation by VH and VL domain 
shuffling. Random mutagenesis of CDR and/or framework residues is described by: Barbas, C. F. 
et al Proc Nat. Acad. Sci, USA 91:3809-3813 (1994), Schier, R. et al Gene 169:147-155 (1995), 
Yelton, D. E. et al J. Immunol 155:1994-2004 (1995), Jackson, J.R. et al, J. Immunol 

35 154(7):3310-9 (1995), and Hawkins, R.E. et al J. Mol Biol 226:889-896 (1992). Humanized 
antibodies are known. Jones et al, Nature, 321:522-525 (1986); Reichmann et al, Nature, 
332:323-329 (1988); and Presta, Curr. Op. Struct. Biol, 2:593-596 (1992)). 
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progeny that have the same function or biological activity as screened for in the originally 
transformed cell are included. Where distinct designations are intended, it will be clear from the 
context. 

The terms "competent cells" and "electoporation competent cells" mean cells which are in 
5 a state of competence and able to take up DNAs from a variety of sources. The state may be 
transient or permanent. Electroporation competent cells are able to take up DNA during 
electroporation. 

"Control sequences" when referring to expression means DNA sequences necessary for the 
expression of an operably linked coding sequence in a particular host organism. The control 

10 sequences that are suitable for prokaryotes, for example, include a promoter, optionally an operator 
sequence, a ribosome binding site, and possibly, other as yet poorly understood sequences. 
Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 

The term "coat protein" means a protein, at least a portion of which is present on the 
surface of the virus particle. From a functional perspective, a coat protein is any protein which 

15 associates with a virus particle during the viral assembly process in a host cell, and remains 

associated with the assembled virus until it infects another host cell. The coat protein may be the 
major coat protein or may be a minor coat protein. A "major" coat protein is a coat protein which 
is present in the viral coat at 10 copies of the protein or more. A major coat protein may be present 
in tens, hundreds or even thousands of copies per virion. 

20 The "detection limit" for a chemical entity in a particular assay is the minimum 

concentration of that entity which can be detected above the background level for that assay. For 
example, in the phage ELISA of Example 5, the "detection limit" for a particular phage displaying 
a particular protein (e.g. hGH) is the phage concentration at which the particular phage produces an 
ELISA signal above that produced by a control phage not displaying the protein. 

25 The terms "electroporation" and "electroporating" mean a process in which foreign matter 

(protein, nucleic acid, etc.) is introduced into a cell by applying a voltage to the cell under 
conditions sufficient to allow uptake of the foreign matter into the cell. The foreign matter is 
typically DNA. 



30 bacteriophage to infect the cell. The episome may contain other genes, for example selection 
genes, marker genes, etc. Common F' episomes are found in well known E. coli strains including 
CJ236, CSH18, DHSalphaF, JM101 (same as in JM103, JM105, JM107, JM109, JM110), 
KS1000, XL 1 -BLUE and 71-18. These strains and the episomes contained therein are 
commercially available (New England Biolabs) and many have been deposited in recognized 

35 depositories such as ATCC in Manassas, VA. 

A "fusion protein" is a polypeptide having two portions covalently linked together, where 
each of the portions is a polypeptide having a different property. The property may be a biological 



An "F factor" or "F' episome" is a DNA which, when present in a cell, allows 
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"Heterologous DNA" is any DNA that is introduced into a host cell. The DNA may be 
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25 to prevent self-ligation during the ligation step. 

A "mutation" is a deletion, insertion, or substitution of a nucleotide(s) relative to a 
reference nucleotide sequence, such as a wild type sequence. 

A "silent mutation" is a mutation which does not change the amino acid sequence of the 
translated polypeptide product of a given DNA sequence. 
30 A "non-silent mutation" is a mutation which changes the amino acide sequence of the 

translated polypeptide product of a given DNA sequence. 

"Operably linked" when referring to nucleic acids means that the nucleic adds are placed 
in a functional relationship with another nucleic acid sequence. For example, DNA for a 
presequence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a 

35 preproteinthatpa^^^ 
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Generally, "operably linked" means that the DNA sequences being linked are contiguous and, in 
the case of a secretory leader, contiguous and in reading phase. However, enhancers do not have to 
be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do 
not exist, the synthetic oligonucleotide adapters or linkers are used in accord with conventional 
5 practice. 

"Phage display" is a technique by which variant polypeptides are displayed as fusion 
proteins to a coat protein on the surface of phage, e.g. filamentous phage, particles. A utility of 
phage display lies in the fact that large libraries of randomized protein variants can be rapidly and 
efficiently sorted for those sequences that bind to a target molecule with high affinity. Display of 

10 peptides and proteins libraries on phage has been used for screening millions of polypeptides for 
ones with specific binding properties. Polyvalent phage display methods have been used for 
displaying small random peptides and small proteins through fusions to either gene III or gene VIII 
of filamentous phage. Wells and Lowman, Curr. Opin. Struct. Biol., 1992, 3:355-362 and 
references cited therein. In monovalent phage display, a protein or peptide library is fused to a 

15 gene III or a portion thereof and expressed at low levels in the presence of wild type gene III 
protein so that phage particles display one copy or none of the fusion proteins. Avidity effects are 
reduced relative to polyvalent phage so that sorting is on the basis of intrinsic ligand affinity, and 
phagemid vectors are used, which simplify DNA manipulations. Lowman and Wells, Methods: A 
companion to Methods in Enzymology, 1991, 3:205-216. 

20 A "phagemid" is a plasmid vector having a bacterial origin of replication, e.g., ColEl, and 

a copy of an intergenic region of a bacteriophage. The phagemid may be based on any known 
bacteriophage, including filamentous bacteriophage and lambdoid bacteriophage. The plasmid will 
also generally contain a selectable marker for antibiotic resistance. Segments of DNA cloned into 
these vectors can be propagated as plasmids. When cells harboring these vectors are provided with 

25 all genes necessary for the production of phage particles, the mode of replication of the plasmid 
changes to rolling circle replication to generate copies of one strand of the plasmid DNA and 
package phage particles. The phagemid may form infectious or non-infectious phage particles. 
This term includes phagemids which contain a phage coat protein gene or fragment thereof linked 
to a heterologous polypeptide gene as a gene fusion such that the heterologous polypeptide is 

30 displayed on the surface of the phage particle. Sambrook et al., above, 4. 17. 

The term "phage vector" means a double stranded replicative form of a bacteriophage 
containing a heterologous gene and capable of replication. The phage vector has a phage origin of 
replication allowing phage replication and phage particle formation. The phage is preferably a 
filamentous bacteriophage, such as an Ml 3, fl, fd, Pf3 phage or a derivative thereof, or a lambdoid 

35 phage, such as lambda, 2 1 , phi80, phi8 1 , 82, 424, 434, etc., or a derivative thereof. 

"Preparation" of DNA from cells means isolating the plasmid DNA from a culture of the 
host cells. Commonly used methods for DNA preparation are the large- and small-scale plasmid 
preparations described in sections 1.25-1,33 of Sambrook et al, supra. After preparation of the 
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A "survivor" is a cell which remains viable after a transformation process. 

A "transcription regulatory element" will contain one or more of the following 
components: an enhancer element, a promoter, an operator sequence, a repressor gene, and a 
transcription termination sequence. These components are well known in the art. U.S. 5,667,780. 
5 A "transformant" is a cell which has taken up and maintained DNA as evidenced by the 

expression of a phenotype associated with the DNA (e.g., antibiotic resistance conferred by a 
protein encoded by the DNA). 

'Transformation" means a process whereby a cell takes up DNA and becomes a 
"transformant". The DNA uptake may be permanent or transient. 
10 "Transformation efficiency" means the number of transformants produced per unit mass of 

DNA following a transformation procedure (e.g. transformants per microgram of DNA). 

"Transformation frequency" means the ratio of the number of transformants to the number 
of survivors. 

"Transformation yield" means the number of transformants produced in a single 

1 5 eiectroporation reaction. 

A "variant" or "mutant" of a starting polypeptide, such as a fusion protein or a 
heterologous polypeptide (heterologous to a phage), is a polypeptide that 1) has an amino acid 
sequence different from that of the starting polypeptide and 2) was derived from the starting 
polypeptide through either natural or artificial (manmade) mutagenesis. Such variants include, for 

20 example, deletions from, and/or insertions into and/or substitutions of, residues within the amino 
acid sequence of the polypeptide of interest. Any combination of deletion, insertion, and 
substitution may be made to arrive at the final variant or mutant construct, provided that the final 
construct possesses the desired functional characteristics. The amino acid changes also may alter 
post-translational processes of the polypeptide, such as changing the number or position of 

25 glycosylation sites. Methods for generating amino acid sequence variants of polypeptides are 
described in U. S. 5,534,615, expressly incorporated herein by reference. 

Generally, a variant coat protein will possess at least 20% or 40% sequence identity and up 
to 70% or 85% sequence identity, more preferably up to 95% or 99.9% sequence identity, with the 
wild type coat protein. Percentage sequence identity is determined, for example, by the Fitch et al, 

30 Proc. Natl. Acad. Sci. USA 80: 1382-1386 (1983), version of the algorithm described by Needleman 
et al. y J. Mol. Biol. 48:443-453 (1970), after aligning the sequences to provide for maximum 
homology. Amino acid sequence variants of a polypeptide are prepared by introducing appropriate 
nucleotide changes into DNA encoding the polypeptide, or by peptide synthesis. An "altered 
residue" is a deletion, insertion or substitution of an amino acid residue relative to a reference 

35 amino acid sequence, such as a wild type sequence. 

A "functional" mutant or variant is one which exhibits a detectable activity or function 
which is also detectably exhibited by the wild type protein. For example, a "functional" mutant or 
variant of the major coat protein is one which is stably incorporated into the phage coat at levels 
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which can be experimentally detected. Preferably, the phage coat incorporation can be detected in 
arangeof about 1 fusion per 1000 virus particles up to about 1000 fusions per virus parbcle. 

A "hyper-functional" mutant or variant is a functional mutant or variant whose actmty 
exceeds that of the wild type. For example, a hyper-functional mutant or variant of the major coat 
5 protein is one which is stably incorporated into the phage coat at levels greater than those of the 
wild type protein in an identical context. 

A "hypo-functional" mutant or variant is a functional mutant or variant whose actmty is 

lessthan that ofthe wild type, ^f^^^^^^^^^ 9 ^ 
protein is one which is stably incorporated into the phage coat at levels less than thoseof the wud 

10 type protein in an identical context. 

A "wild type" sequence or the sequence of a "wild type" protein, such as a coat protem, is 
the reference sequence from which variant polypeptides are derived through me introduce of 
mutations. In general, the "wild type" sequence for a given protein is the sequence that is most 
commoninnature. Similarly, a "wild type" gene sequence is the sequence for that gene whrchis 

15 m0 st commonly found innature. Mutations may be introduced into a "wild type" gene (and thus 
the protein it encodes) either through natural processes or through man induced means. Th. 
products of such processes are "variant" or "mutant" forms ofthe original "wild type" protem or 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
A. Novel methods and cells 

The present invention provides a method of transforming cells by electroporating 
5 competent cells in the presence of heterologous DNA, where the DNA has been purified by DNA 
affinity purification. Preferably, for library construction in bacteria, the DNA is present at a 
concentration of 25 micrograms/mL or greater. Preferably, the DNA is present at a concentration 
of about 30 micrograms/mL or greater, more preferably at a concentration of about 70 
micrograms/mL or greater and even more preferably at a concentration of about 100 

10 micrograms/mL or greater even up to several hundreds of micrograms/mL. Generally, the method 
of the invention will utilize DNA concentrations in the range of about 50 to about 500 
micrograms/mL. It has been discovered that by highly purifying the heterologous DNA, a time 
constant during electroporation greater than 3.0 milliseconds (ms) is possible even when the DNA 
concentration is very high, which results in a high transformation efficiency. Over the DNA 

15 concentration range of about 50 microgram/mL to about 400 microgram/mL, the method of the 
invention allows the use of time constants in the range of about 3.6 to about 4.4 ms using standard 
electroporation instruments. The invention therefore provides a method with greater dynamic 
range in DNA concentration than previously known. 

The high DNA concentrations used in the method of the invention are obtained by highly 

20 purifying DNA used to transform the competent cells. In the method of the invention, the DNA is 
purified to remove contaminants which increase the conductance of the DNA solution used in the 
electroporating process. The DNA may be purified by any known method, however, a preferred 
purification method is the use of DNA affinity purification. The purification of DNA, e.g., 
recombinant linear or plasmid DNA, using DNA binding resins and affinity reagents is well known 

25 and any of the known methods can be used in this invention (Vogelstein, B. and Gillespie, D., 
1979, Proc. Natl Acad. ScL USA, 76:615; Callen, W., 1993, Strategies, 6:52-53). Commercially 
available DNA isolation and purification kits are also available from several sources including 
Stratagene (CLEARCUT Miniprep Kit), and Life Technologies (GLASSMAX DNA Isolation 
Systems). Suitable nonlimiting methods of DNA purification include column chromatography 

30 (U.S. 5,707,812), the use of hydroxylated silical polymers (U.S. 5,693,785), rehydrated silica gel 
(U.S. 4,923,978), boronated silicates (U.S. 5,674,997), modified glass fiber membranes (U.S. 
5,650,506; U.S. 5,438,127), fluorinated adsorbents (U.S. 5,625,054; U.S. 5,438,129), diatomaceous 
earth (U.S. 5,075,430), dialysis (U.S. 4,921,952), gel polymers (U.S. 5,106,966) and the use of 
chaotropic compounds with DNA binding reagents (U.S. 5,234,809). After purification, the DNA 

35 is eluted or otherwise resuspended in water, preferably distilled or deionized water, for use in 
electroporation at the concentrations of the invention. The use of low salt buffer solutions is also 
contemplated where the solution has low electrical conductivity, i.e., is compatible with the use of 
the high DNA concentrations of the invention with time constants greater than about 3.0 ms. 
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My cells which can be transformed by electroporation may be used as host cells in the 
meth od of the present invention. Suitable host cells which can be ~*^»££ m 
DNA in the method of the invention include animal cells (Neumann et al., EMBO J, (1982), 1.841 
Wong and Neumann, Biochem. Biophys. Res. Commu,, (1982), 107:584; Potter et al., Proc. Nat 
5 Acad. «. USA, (1984) 81:7161; Sugden et al., Mol. Cell. Biol. (1985), 5:410; Toneguzzo et al., 
Mol. Cell. Biol., (1986), 6:703; Pur-Kaspa et al., Mol. Cell. Biol., (1986), 6:716), plant ^ 
(Fromm et al., Proc. Natl. Acad. Sci., USA, (1985), 82:5824; From, et al., Nature 1986) 
319 791; Eclcer and Davis, Proc. Natl. Acad. Sci., USA, (1986) 83:5372) andbactenal cells (Chuet 
al Nucleic Acids Res., (1987), 15:1311; Knutson and Yee, Anal. Biochem, (1987) 164:44). 
.0 Prokaryotes are the preferred host cells for this invention. See also Andreason and Evans, 
Biotech, (1988), 6:650 which describes parameters which effect transfection efficren.es for 
varying eel, lines. Suitable bacteria, cells include * coli (Dower et al., above; Taketo, Btoc^ 
BiopHys. Acta, (1988), 149:318), L. casei (Chassy and Flickinger, FEMS Microbiol. Lett ( 987), 
44-173) Strep, lactis (Powell et al., Appl. Environ. Microbiol., (1988), 54:655; Harlander, 
l5 streptolocca, Gene.cs, ed . J. Ferretti and R. Curtiss, III), page 229, American Society or 
Microbiology, Washington, D.C., (1987)), *** tnermopkilus (Somkuti and Sternberg Pro, 4tk 
Eu , Con g . BiotecHnolosy, 1987, 1:412); Campylobacter Jejuni (Miller et al., Proc. Natl. A^ 
Sci USA, (1988) 85:856), and other bacterial strains (Fielder and Wirth, Anal. Biochenu (198 , 
,70-38) including bacilli such as Bacillus subtilis, other enterobacteriaceae such as Salmonella 
20 typnimuriu* or Serratia marcesans, and various species which may all be used « 

L. Suitable Ecoli strains include 1M 101, c.// K12 strain 294 (ATCC numer 31,^ 
coli strain W31 10 (ATCC number 27,325), E. coli X1776 (ATCC number 31,537), E. co XL- 
IBlue (Stratagene), and E. coli B; however many other strains of E. coli, such as XL1-B ue 
MRF' SURE, ABLE C, ABLE K, WM1100, MC1061, HB101, CI136, MV1190, JS4, S5, 
25 NM522, NM538, NM539, TGland many other species and genera of prokaryotes may be used as 

Cells are made competent using known procedure, Sambrook et al., above, 1.76-1.81, 

16 ' 30 ' The heterologous DNA is preferably in the form of a replicable transcription or expression 
30 vector, such as a plasmid, phage or phagemid which can be constructed with relative ease an 
rea dily amplified. Tnese vectors generally contain a promoter, a signal sequence, phenotyptc 
selection genes, origins of replication, and other necessary components which are known to those 
of ordinary skill in this art. Construction of suitable vectors containing these components as weU as 
tte gene encoding one or more desired clone* polypeptides are prepared using stancUrd 
35 ^ombinantDNAproceduresasdescribedinSambrooketal.,above. Isolated DNA fragments to 
be combined to form the vector are cleaved, tailored, and ligated together in a specie order and 
orientation to generate the desired vector. 
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The gene encoding the desired polypeptide (i.e., a peptide or a polypeptide with a rigid 
secondary structure) can be obtained by methods known in the art (see generally, Sambrook et ai). 
If the sequence of the gene is known, the DNA encoding the gene may be chemically synthesized 
(Merrfield, J. Am. Chem. Soc, 85 :2149 (1963)). If the sequence of the gene is not known, or if the 
5 gene has not previously been isolated, it may be cloned from a cDNA library (made from RNA 
obtained from a suitable tissue in which the desired gene is expressed) or from a suitable genomic 
DNA library. The gene is then isolated using an appropriate probe. For cDNA libraries, suitable 
probes include monoclonal or polyclonal antibodies (provided that the cDNA library is an 
expression library), oligonucleotides, and complementary or homologous cDNAs or fragments 
10 thereof. The probes that may be used to isolate the gene of interest from genomic DNA libraries 
include cDNAs or fragments thereof that encode the same or a similar gene, homologous genomic 
DNAs or DNA fragments, and oligonucleotides. Screening the cDNA or genomic library with the 
selected probe is conducted using standard procedures as described in chapters 10-12 of Sambrook 
et al., above. 

15 An alternative means to isolating the gene encoding the protein of interest is to use 

polymerase chain reaction methodology (PCR) as described in section 14 of Sambrook et aL, 
above. This method requires the use of oligonucleotides that will hybridize to the gene of interest; 
thus, at least some of the DNA sequence for this gene must be known in order to generate the 
oligonucleotides. 

20 After the gene has been isolated, it may be inserted into a suitable vector (preferably a 

plasmid) for amplification, as described generally in Sambrook et al. 

The DNA is cleaved using the appropriate restriction enzyme or enzymes in a suitable 
buffer. In general, about 0.2-1 pg of plasmid or DNA fragments is used with about 1-2 units of the 
appropriate restriction enzyme in about 20 |Jl of buffer solution. Appropriate buffers, DNA 

25 concentrations, and incubation times and temperatures are specified by the manufacturers of the 
restriction enzymes. Generally, incubation times of about one or two hours at 37°C are adequate, 
although several enzymes require higher temperatures. After incubation, the enzymes and other 
contaminants are removed by extraction of the digestion solution with a mixture of phenol and 
chloroform, and the DNA is recovered from the aqueous fraction by precipitation with ethanol or 

30 other DNA purification technique. 

To ligate the DNA fragments together to form a functional vector, the ends of the DNA 
fragments must be compatible with each other. In some cases, the ends will be directly compatible 
after endonuclease digestion. However, it may be necessary to first convert the sticky ends 
commonly produced by endonuclease digestion to blunt ends to make them compatible for ligation. 

35 To blunt the ends, the DNA is treated in a suitable buffer for at least 15 minutes at 15 C with 10 
units of the Klenow fragment of DNA polymerase I (Klenow) in the presence of the four 
deoxynucleotide triphosphates. The DNA is then purified by phenol-chloroform extraction and 
ethanol precipitation or other DNA purification technique. 
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He cleaved DNA fragments may be size-separated and selected using DNA gel 
electrophoresis. He DNA may be electrophoresed through either an agarose or a polyacrylannde 
matrix. He selection of the matrix will depend on the size of the DNA fragments to be separated. 
After electrophoresis, the DNA is extracted from the matrix by electroelution, or, if low-meltmg 
S agarose has been used as the matrix, by melting the agan.se and extracting the DNA from U, as 
described in sections 6.30-6.33 of Sambrook et al., supra. 

He DNA fragments that are to be ligated together (previously digested with the 
appropriate restriction enzymes such that the ends of each fragment to be ligated are compile) 
a* put in solution in about equimolar amounts. He solution will also contain ATT, ligase buffer 
and a ligase such as T4 DNA ligase at about 10 units per 0.5 ,g of DNA. If the DNA fragment . 
to be ligated into a vector, the vector is at first linearized by cutting with the appropriate restnctron 
endonuclease(s). He linearized vector is then treated with alkaline phosphatase or calf intestmal 
phosphatase. He phosphatase prevents self-ligation of the vector during the ligatron step^ 

After ligation, the vector with the foreign gene now inserted is purified as descnbed above 
and transformed into a suitable host cell such as those described above by electroporation usmg 
known and commercially available electroporation instruments and the procedures outlined by the 
manufacturers and described generally in Dower et al., above. He invention provides high 
transformation yields, a single electroporation reaction typically yields greater than 1 x 
transformants. However, more than one (a plurality) electroporation may be conducted to rncrease 
the amount of DNA which is transformed into the host ells. Repeated electroporaUons are 
conducted as described in the art. See Vaughan et al., above. He number of 
electroporations may vary as desired from several (2,3,4,...10) up to tens (10, 20, 30....100) and 
even hundreds (100, 200, 300,-1000). Repeated electroporations may be desired to rncrease the 
slze of a combinatorial library, e.g. an antibody library, transformed into the host cells. Wrth a 
plurality of electroporations, it is possible to produce a library having at least 1 .0 x 10 , even 2.0 x 
10", different members (clones, DNA vectors such as phage, phagemids, plasmrds, etc., cells, 
etc ) 

Election ma, be osnied out using methods to™ in me urt sod desoriued, for 
example, „ U.S. 4*0,.40; U.S. 5,186,800; O S. 4,84„55; , U.S. W^* 
„ U.S. 5,422,272; U.S. 5,222,856; U.S. 5,283,194; U.S. 5,128,257; U.S. 5,750,373; U.S. 4,056,288 
or any outer kmown buluh o, eomiuuous elecartporadoo prouess rogedKr with me improvmums of 

the invention. . , 

Typically, electrocompetent cells are mixed with a solution of DNA at the desrred 
concentration at ice temperatures. An aliquot of the mixture is placed into a cuvette and placed m 
35 an electroporation instrument, e.g., GENE PULSER (Biorad) having a typical gap of 0.2 «l 
Each cuvette is electroporated as described by the manufacturer. Typical settings are: voltage = 2.5 
kV resistance = 200 ohms, capacitance = 25 mF. Tbe cuvette is then immediately removed, SOC 
media (Maniatis) is added, and the sample is transferred to a 250 mL baffled flask. He contents of 
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several cuvettes may be combined after electroporation. The culture is then shaken at 37°C to 
culture the transformed cells. 

The transformed cells are generally selected by growth on an antibiotic, commonly 
tetracycline (tet) or ampicillin (amp), to which they are rendered resistant due to the presence of tet 
5 and/or amp resistance genes in the vector. 

After selection of the transformed cells, these cells are grown in culture and the vector 
DNA (plasmid or other vector with the foreign gene inserted) may then be isolated. Vector DNA 
can be isolated using methods known in the art. Two suitable methods are the small scale 
preparation of DNA and the large-scale preparation of DNA as described in sections 1.25-1.33 of 

10 Sambrook et al., supra. The isolated DNA can be purified by methods known in the art such as 
that described in section 1.40 of Sambrook et al., above and as described above.. This purified 
DNA is then analyzed by restriction mapping and/or DNA sequencing. DNA sequencing is 
generally performed by either the method of Messing et al., Nucleic Acids Res., 9:309 (1981) or by 
the method of Maxam et al., Meth. Enzymol., 65:499 (1980). 

15 This invention also contemplates fusing the gene encoding the desired polypeptide (gene 

1) to a second gene (gene 2) such that a fusion protein is generated during transcription. Gene 2 is 
typically a coat protein gene of a filamentous phage, preferably phage M 13 or a related phage, and 
the gene is preferably the coat protein IH gene or the coat protein VIII gene, or a fragment thereof. 
See U.S. 5,750,373; WO 95/34683. Fusion of genes 1 and 2 may be accomplished by inserting 

20 gene 2 into a particular site on a plasmid that contains gene 1, or by inserting gene 1 into a 
particular site on a plasmid that contains gene 2 using the standard techniques described above. 

Alternatively, gene 2 may be a molecular tag for identifying and/or capturing and purifying 
the transcribed fusion protein. For example, gene 2 may encode for Herpes simplex virus 
glycoprotein D (Paborsky et al., 1990, Protein Engineering, 3:547-553) which can be used to 

25 affinity purify the fusion protein through binding to an anti-gD antibody. Gene 2 may also code 
for a polyhistidine, e.g., (his) 6 (Sporeno et al., 1994, J. Biol Chem., 269:10991-10995; Stuber et 
al., 1990, Immunol. Methods, 4:121-152, Waeber et aL, 1993, FEES Letters, 324:109-112), which 
can be used to identify and/or purify the fusion protein through binding to a metal ion (Ni) column 
(QIAEXPRESS Ni-NTA protein Purification System, Quiagen, Inc.). Other affinity tags known in 

30 the art may be used and encoded by gene 2. 

Insertion of a gene into a plasmid requires that the plasmid be cut at the precise location 
that the gene is to be inserted. Thus, there must be a restriction endonuclease site at this location 
(preferably a unique site such that the plasmid will only be cut at a single location during 
restriction endonuclease digestion). The plasmid is digested, phosphatased, and purified as 

35 described above. The gene is then inserted into this linearized plasmid by ligating the two DNAs 
together. Ligation can be accomplished if the ends of the plasmid are compatible with the ends of 
the gene to be inserted. If the restriction enzymes are used to cut the plasmid and isolate the gene 
to be inserted create blunt ends or compatible sticky ends, the DNAs can be ligated together 
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directly using a ligase such as bacteriophage T4 DNA ligase aud incubating the mixture at 1 C for 
1-4 hours in the presenceof ATP and ligase buffer as described in section 1.68 of Sambrook « */ 
above. Iftheendsare not compatible, theymust first bemade blunt by using the Klenow fiagment 
of DNA polymerase 1 or bacteriophage T4 DNA polymerase, both of which recnnre the four 

5 deoxynbonucleoudetripho^^ 

Alternative*, the ends may be blunted using a nuclease such as nuclease SI or mung-bean 
nuclease, both of which function by cutting back the overhanging single strands of DNA. The 
DNA is then religated using a ligase as described above. In some cases, it may not be poss, le to 
blunt the ends of the gene to be mserted, as the reading frame of the coding region will be a tered. 
10 To overcome this problem, oligonucleotide linkers may be used. Tbe linkers serve as a bndge to 
connect the plasmid to the gene to be inserted. These linkers can be made synthet,cally as double 
stra „ded or single stranded DNA using standard methods. The linkers have one end that ,s 
compatible with the ends of the gene to be inserted; the linkers are first ligated to this gene usmg 
ligation methods described above. The other end of the linkers is designed to be companble wUh 
15 tbe plasmid for ligation. In designing the linkers, care must be taken to not destroy Ibe reachng 
frame of the gene to be inserted or the reading frame of the gene contained on the plasmxd. In 
some cases, it may be necessary to design the linkers such that they code for part of an ammo aad, 
or such that they code for one or more amino acids. 

Between gene 1 and gene 2, DNA encoding a termination codon may be mserted, such 
20 termination codons are UAG( amber), UAA (ocher) and UGA (opel). (Microtia^, Davis et al. 
Harper ft Row, New York, 1980, pages 237, 245-47 and 274). Tbe termination codon expressed m 
a wild type host cel. results in the synthesis of the gene 1 protein product without the 
protein attached. However, growth in a suppressor host cell results in tbe synthesrs of detectable 
quantities offbsed protein. Such suppressor host cells contain a tRNA modified to ins^anammo 
25 acid in the termination codon position of the mRNA thereby resulting in product™ of detectaVe 
amounts of tbe fusion protein. Such suppressor host cells are well known and described such as K 
coli suppressor strain (Bullock et a.., BioTecHni.ues 5:376-379 [1987]). Any acceptable method 
may be used to place such a tennination codon into the mRNA encoding the fusion polypeptide. 

The suppressive codon may be inserted between the first gene encoding a polypepUde, 
30 and a second gene encoding at least a portion of a phage coat protein. Alternatively, the 
suppressive termination codon may be inserted adjacent to the fusion site by replacmg me last 
amino acid triplet in tbe poiypeptide or the fust amino acid in the phage coat protem. When *e 
plasmid containing the suppressive codon is grown in a suppressor host cell, * results 
detectable production of a fusion polypeptide containing tbe polypeptide and the coat protem 
35 When the plasmid is grown in a non-suppressor host cel., the polypeptide is synthesized 
substantially without fusion to the phage coat protein due to termination at the mserted 
suppressive triplet encoding UAG, UAA, or UGA. In the non-suppressor cell the polypepUde „ 



WO 00/06717 



PCT/US99/16596 



synthesized and secreted from the host cell due to the absence of the fused phage coat protein 
which otherwise anchored it to the host cell. 

Gene i may encode a mammalian protein, and preferably the protein will be selected from 
human growth hormone(hGH), N-methionyl human growth hormone, bovine growth hormone, 
5 parathyroid hormone, thyroxine, insulin A-chain, insulin B-chain, proinsulin, relaxin A-chain, 
relaxin B-chain, prorelaxtn, glycoprotein hormones such as follicle stimulating hormone(FSH), 
thyroid stimulating hormone(TSH), leutinizing hormone(LH), glycoprotein hormone receptors, 
calcitonin, glucagon, factor VIII, an antibody, lung surfactant, urokinase, streptokinase, human 
tissue-type plasminogen activator (t-PA), bombesin, coagulation cascade factors including factor 

10 VII, factor IX, and factor X, thrombin, hemopoietic growth factor, tumor necrosis factor-alpha and 
-beta, enkephalinase, human serum albumin, mullerian-inhibiting substance, mouse gonadotropin- 
associated peptide, a microbial protein, such as betalactamase, tissue factor protein, inhibin, 
activin, vascular endothelial growth factor (VEGF), receptors for hormones or growth factors; 
integrin, thrombopoietin (TPO), protein A or D, rheumatoid factors, nerve growth factors such as 

15 NGF- alpha, platelet-growth factor, transforming growth factors (TGF) such as TGF-alpha and 
TGF-beta, insulin-like growth factor-I and -II, insulin-like growth factor binding proteins, CD-4, 
DNase, latency associated peptide, erythropoietin (EPO), osteoinductive factors, interferons such 
as interferon-alpha, -beta, and -gamma, colony stimulating factors (CSFs) such as M-CSF, GM- 
CSF, and G-CSF, interleukins (ILs) such as IL-1, IL-2, IL-3, IL-4, IL-6, IL-8, IL-10, IL-12, 

20 superoxide dismutase; decay accelerating factor, viral antigen, HTV envelope proteins such as 
GP120, GP140, atrial natriuretic peptides A, B, or C, immunoglobulins, as well as variants and 
fragments of any of the above-listed proteins. 

The first gene may encode a peptide containing as few as 4-10 amino acid residues and up 
to about 50 -80 residues. These smaller peptides are useful in determining the antigenic properties 

25 of the peptides, in mapping the antigenic sites of proteins, etc. The first gene may also encode a 
polypeptide of one or more subunits containing more than about 100 amino acid residues which 
may be folded to form a plurality of rigid secondary structures displaying a plurality of amino acids 
capable of interacting with the target. Preferably the first gene will be mutated at codons 
corresponding to only the amino acids capable of interacting with the target so that the integrity of 

30 the rigid secondary structures will be preserved. 

Phage display of proteins, peptides and mutated variants thereof, including constructing a 
family of variant replicable vectors containing a transcription regulatory element operably linked to 
a gene fusion encoding a fusion polypeptide, transforming suitable host cells, culturmg the 
transformed cells to form phage particles which display the fusion polypeptide on the surface of the 

35 phage particle, contacting the recombinant phage particles with a target molecule so that at least a 
portion of the particle bind to the target, separating the particles which bind from those that do not 
are known and may be used with the transformation method of the invention. See U.S. 5,750,373; 
WO 97/09446; U.S. 5,514,548; U.S. 5,498,538; U.S. 5,516,637; U.S. 5,432,018; WO 96/22393; 
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US 5 658727- US. 5,627,024; WO 97/29.85; OHo* « A 1997, Kn*** 236;«8.547; 
U.S. 5,668,72/, U.6. 3, , t . 7 ., 7S igo- O'Neil and Hoess, 1995, Curr.Opm. 

So ral iUioae< a l,1994,^.fii~te«««» rf -. 47 " S l' 0 . 0 " 

W AM 5 443-449; Makowski, 1993. Gene, 128:5-11; Dunn, 1996. On. Opm. Sm«*. »*, 
^ST- Kln8, „95, — «. - «^ 

2*. o— - *. <*» aw - ,;61W21; Mcl ^ 
l0 ** TZ^y — - ' — - * ~ 7C 

.u fmrhFnh F(ab')> Fv diabodies, linear antibodies, etc. Gene 1 
«f an antibodv or fragments thereof, such fab,i-taD;2,rv,uia 

of an antibody tragm ^ ^ e preparation of libraries of antibodies or 

published. Hoogenboom, 1997, ™«*> \ tf ^ 1992 , /mmMBO , 

Winter et aL, 1994, Annu. Rev. Immunol., 12.433 
130-109-124; Jefferies, 1998, Parasitology, 14:202-206. 

hematopoietic growth factor snperfamily receptors and preferably the extrace 
hematopoietic gro glycoprote in cell surface receptors that share 

35 thereof, which are a group of closely dassified as 

considerable homology including frequently a WSXWS domam an g 
members of the cytokine rector superfamily (see e.g. Nicola et aL, Cell, 67.1-4 (1991 and 
rr R .C - aLmO , 12:2645-2653 (199,). Generally, these targets are receptors for 

fry 
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interleukins (IL) or colony-stimulating factors (CSF). Members of the superfamily include, but are 
not limited to, receptors for: IL-2 (b and g chains) (Hatakeyama et al., Science, 244:551-556 

(1989) ; Takeshita et al., Science, 257:379-382 (1991)), IL-3 (Itoh et al, Science, 247:324-328 

(1990) ; Gorman et al, Proc. Natl. Acad. Sci USA, 87:5459-5463 (1990); Kitamura et al., Cell, 
5 66:1 165-1174 (1991a); Kitamura et al., Proc. Natl. Acad. Sci. USA, 88:5082-5086 (1991b)), IL-4 

(Mosley et al., Cell, 59:335-348 (1989), IL-5 (Takaki et al., EMBO J., 9:4367-4374 (1990); 
Tavernier et al., Cell, 66:1 175-1184 (1991)), IL-6 (Yamasaki et al, Science, 241:825-828 (1988); 
Hibi et al, Cell, 63:1149-1157 (1990)), IL-7 (Goodwin et al, Cell, 60:941-951 (1990)), IL-9 
(Renault et al, Proc. Natl. Acad. Sci. USA, 89:5690-5694 (1992)), granulocyte-macrophage 

10 colony-stimulating factor (GM-CSF) (Gearing et al, EMBO J., 8:3667-3676 (1991); Hayashida et 
al, Proc. Natl. Acad. Sci. USA, 244:9655-9659 (1990)), granulocyte colony-stimulating factor (G- 
CSF) (Fukunaga et al, Cell, 61:341-350 (1990a); Fukunaga et al, Proc. Natl Acad. Sci. USA, 
87:8702-8706 (1990b); Larsen et al, J. Exp. Med., 172:1559-1570 (1990)), EPO (D'Andrea et al, 
Cell, 57:277-285 (1989); Jones et al, Blood, 76:31-35 (1990)), Leukemia inhibitory factor (LIF) 

15 (Gearing et al, EMBO J., 10:2839-2848 (1991)), oncostatin M (OSM) (Rose et al, Proc. Natl. 
Acad. Sci. USA, 88:8641-8645 (1991)) and also receptors for prolactin (Boutin et al, Proc. Natl. 
Acad. Sci. USA, 88:7744-7748 (1988); Edery et al, Proc. Natl Acad. Sci. USA, 86:2112-2116 
(1989)), growth hormone (GH) (Leung et al, Nature, 330:537-543 (1987)), ciliary neurotrophic 
factor (CNTF) (Davis et al, Science, 253:59-63 (1991) and c-Mpl (M Souyri et al, Cell 63:1137 

20 (1990); I. Vigon et al, Proc. Natl Acad Sci. 89:5640 (1992)). Still other targets for antibodies 
made by the invention are erb2, erb3, erb4, IL-10, IL-12, IL-13, IL-15, etc. 

Gene 1 , encoding the desired polypeptide, may be altered at one or more selected codons. 
An alteration is defined as a substitution, deletion, or insertion of one or more codons in the gene 
encoding the polypeptide that results in a change in the amino acid sequence of the polypeptide as 
25 compared with the unaltered or native sequence of the same polypeptide. Preferably, the 
alterations will be by substitution of at least one amino acid with any other amino acid in one or 
more regions of the molecule. The alterations may be produced by a variety of methods known in 
the art. These methods include but are not limited to oligonucleotide-mediated mutagenesis and 
cassette mutagenesis. 

30 Oligonucleotide -mediated mutagenesis is preferred method for preparing substitution, 

deletion, and insertion variants of gene 1. This technique is well known in the art as described by 
Zoller et al., Nucleic Acids Res., 10: 6487-6504 (1987). Briefly, gene 1 is altered by hybridizing an 
oligonucleotide encoding the desired mutation to a DNA template, where the template is the single- 
stranded form of the plasmid containing the unaltered or native DNA sequence of gene 1 . After 

35 hybridization, a DNA polymerase is used to synthesize an entire second complementary strand of 
the template will thus incorporate the oligonucleotide primer, and will code for the selected 
alteration in gene 1. 
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homoduplex molecule can then be transformed into a suitable host cell such as E. coli JM101, as 
described above. 

Mutants with more than one amino acid to be substituted may be generated in one of 
several ways. If the amino acids are located close together in the polypeptide chain, they may be 
5 mutated simultaneously using one oligonucleotide that codes for all of the desired amino acid 
substitutions. If, however, the amino acids are located some distance from each other (separated 
by more than about ten amino acids), it is more difficult to generate a single oligonucleotide that 
encodes all of the desired changes. Instead, one of two alternative methods may be employed. 

In the first method, a separate oligonucleotide is generated for each amino acid to be 

10 substituted. The oligonucleotides are then annealed to the single-stranded template DNA 
simultaneously, and the second strand of DNA that is synthesized from the template will encode all 
of the desired amino acid substitutions. The alternative method involves two or more rounds of 
mutagenesis to produce the desired mutant. The first round is as described for the single mutants: 
wild-type DNA is used for the template, an oligonucleotide encoding the first desired amino acid 

15 substitution(s) is annealed to this template, and the heteroduplex DNA molecule is then generated. 
The second round of mutagenesis utilizes the mutated DNA produced in the first round of 
mutagenesis as the template. Thus, this template already contains one or more mutations. The 
oligonucleotide encoding the additional desired amino acid substitution(s) is then annealed to this 
template, and the resulting strand of DNA now encodes mutations from both the first and second 

20 rounds of mutagenesis. This resultant DNA can be used as a template in a third round of 
mutagenesis, and so on. 

Cassette mutagenesis is also a preferred method for preparing substitution, deletion, and 
insertion variants of gene I. The method is based on that described by Wells et al, Gene, 34:315 
(1985). The starting material is the plasmid (or other vector) comprising gene I, the gene to be 

25 mutated. The codon(s) in gene 1 to be mutated are identified. There must be a unique restriction 
endonuclease site on each side of the identified mutation site(s). If no such restriction sites exist, 
they may be generated using the above-described oligonucleotide-mediated mutagenesis method to 
introduce them at appropriate locations in gene 1 . After the restriction sites have been introduced 
into the plasmid, the plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide 

30 encoding the sequence of the DNA between the restriction sites but containing the desired 
mutation(s) is synthesized using standard procedures. The two strands are synthesized separately 
and then hybridized together using standard techniques. This double-stranded oligonucleotide is 
referred to as the cassette. This cassette is designed to have 3 1 and 5' ends that are compatible with 
the ends of the linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid 

35 now contains the mutated DNA sequence of gene 1 . 

In a preferred embodiment, gene 1 is linked to gene 2 encoding at least a portion of a 
phage coat protein. Preferred coat protein genes are the genes encoding coat protein 3 and coat 
protein 8 of filamentous phage specific for E. coli, such as Mi 3, fl and fd phage. Transfection of 

31 
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cells. In general, mixing cultures of the two cell types and growing the mixture in culture medium 
for about one hour at 37°C is sufficient to allow mating and episome transfer to occur. The new 
resulting E. coli strain has the genotype of MCI 061 which carries a streptomycin resistance 
chromosomal marker and the genotype of the F plasmid which confers tetracycline resistance. 
5 The progeny of this mating is resistant to both antibiotics and can be selectively grown in the 
presence of streptomycin and tetracycline. Strain SS320 has been deposited with the American 
Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Virginia, USA on June 
18, 1998 and assigned Deposit Accession No. 98795. 

This deposit of strain SS320 was made under the provisions of the Budapest Treaty on the 

10 International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure 
and the Regulations thereunder (Budapest Treaty). This assures maintenance of a viable culture for 
30 years from the date of deposit. The organisms will be made available by ATCC under the terms 
of the Budapest Treaty, and subject to an agreement between Genentech, Inc. and ATCC, which 
assures permanent and unrestricted availability of the progeny of the cultures to the public upon 

1 5 issuance of the pertinent U.S. patent or upon laying open to the public of any U.S. or foreign patent 
application, whichever comes first, and assures availability of the progeny to one determined by 
the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 USC §122 
and the Commissioner's rules pursuant thereto (including 37 CFR §1.14 with particular reference to 
886 OG 638). 

20 The assignee of the present application has agreed that if the cultures on deposit should die 

or be lost or destroyed when cultivated under suitable conditions, they will be promptly replaced 
on notification with a viable specimen of the same culture. Availability of the deposited cultures is 
not to be construed as a license to practice the invention in contravention of the rights granted 
under the authority of any government in accordance with its patent laws. 

25 SS320 cells have properties which are particularly favorable for electroporation. It has 

been discovered that SS320 cells are particularly robust and are able to survive multiple washing 
steps with higher cell viability than most other electroporation competent cells. The ability of 
SS320 cells to survive washing steps allows one to prepare cell concentrations which are greater 
than the cell concentrations suggested by the Dower et al. protocol. Other strains suitable for use 

30 with the higher cell concentrations include TBI, MC1061, etc. These higher cell concentrations 
provide greater transformation efficiency for the process of the invention. 

The use of higher DNA concentrations during electroporation (about 10X) increases the 
transformation efficiency and increases the amount of DNA transformed into the host cells. The 
use of higher cell concentrations also increases the efficiency (about 10X). The larger amount of 

35 transferred DNA produces larger libraries having greater diversity and representing a greater 
number of unique members of a combinatorial library. The method of the invention is useful to 
increase the size of an expressible combinatorial library by a factor of about 100X with a single 
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election event which allows one to select, amplify and identify rare library members present 
in amounts 100X lower than with comparable conventional methods. 

Dower et al. have demonstrated that saturation (transformation of most survivors of an 
electroporation) can be achieved with closed circular DNA concentrations of about 10 
5 microgram/mL. However, the construction of libraries, for example a library of toon genes 
encoding fusion polypeptides, necessarily involves the introduction of DNA fragments 
representing the library into a suitable vector to prov.de a family or library of vectors. In the case 
of cassette mutagenesis, the synthetic DNA is a double stranded cassette while m fiU-m 
mutagenesis the synthetic DNA is single stranded DNA. In either case, the syntheUc DNA «. 
,0 incorporated into a vector to yield a reaction product containing closed circular double stranded 
DNA which can be transformed into a cell to produce the library. However, processes used to 
incorporate syntheUc DNA into closed circular DNA are generally less than 100% efficient, and 
often the desired closed circular product represents only a small fraction of the total DNA. To 
achieve saturation with a ligation or fill-in reaction product may retire significantly higher DNA 
15 concentrations than those necessary when using pure closed circular DNA. Prior methods do not 
allow or suggest using DNA concentrations which are sufficient to achieve saturate m the 
decoration reactions using DNA incorporating synthetic DNA fragments into transformation 
vectors, e.g. plasmids, phage vectors, phagemid vectors, etc. 

For example, the reaction of Example 6 below, demonstrates that using the method of the 
20 invention, a DNA concentration of 19 microgram/mL results in transformation of 53% of : surviving 
cells in this example, the fill-in reaction ws very efficient yielding about 95% of the desued 
dosed circular DNA product as evidenced by agarose gel electrophoresis. The DNA concentration 
can be increased to hundreds (e.g., 300 - 500 microgram/mL) of micrograms/mL without 
adversely affecting either the cell survival or the transformation. The maximum number of 
25 transforms is obtained at a DNA concentration 20-fold lower than the maxtmum DNA 
concentration tested and even the largest DNA concentration tested had no detrimental effect on 
the electroporation reaction. The invention provides a useable dynamic range of DNA 
concentration in electroporation which far exceeds that possible with prior methods. Usmg the 
method of the invention, inefficient transformation vector formation reactions which y,eld only 
30 small amounts of the desired clonable DNA, for example only 10-50%, or even 1-10% can be 
made to saturate the electroporation survivors by using DNA concentrations up to about 400 -500 

micrograms/mL. . 

The method of the invention also allows the facile introduction of two or more vectors mto 
asingle cell, even with a DNA reaction preparation in which the desired vector (e.g. closed cncular 
35 DNA) represents only a fraction of the total DNA in the preparation. This makes possible the 
simultaneous introduction of multiple foreign genes in separate vectors, for example, two or more 
libraries into a single transformant. The introduction multiple members of a library into a smgle 
cell expands the library diversity beyond the number of transforms For example, if on average 
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a transformant in an electroporation reaction maintains two plasmids, the library diversity will be 
twice the number of transformants. 



concentration beyond which further increases in DNA concentration do not result in increased 
5 transformation yields. 

At the saturation concentration, all cells capable of taking up and maintaining DNA have 
been transformed. Dower et aL, above, have shown that transformation efficiency is directly 
proportional to DNA concentration. An increase in DNA concentration should result in 
corresponding increases in the average number of unique plasmids per survivor. For example, if at 

10 a given DNA concentration, electroporation results in survivors carrying one unique plasmid each 
on average, then doubling the DNA concentration will result in the survivors carrying two unique 
plasmids each on average. 

The current invention allows for the use of DNA concentrations in electroporation at least 
an order of magnitude above that possible with prior art methods. This in turn allows for the 

15 simultaneous introduction of two or more unique plasmids (containing different library members) 
into a single cell during a single electroporation reaction, even with DNA preparations which have 
been enzymatically manipulated. 

Several members of a single library can be introduced into a single cell and thus the library 
diversity can be expanded beyond the number of transformants. For example, if on average a 

20 transformant produced in a given electroporation reaction maintains two plasmids, the library 
diversity will be twice the number of transformants. In the case of phage display libraries, 
packaging of phage DNA or phagemids maintained within the same transformant will result in a 
random display of the different fusion proteins produced from the different phage DNA or 
phagemids. Thus, some of the fusion proteins will be displayed in association with their cognate 

25 DNA sequences while others will be associated with completely unrelated sequences which 
happened to co-transform by random chance. 

In the case of highly polyvalent display (e.g., peptide display on protein-8) where the 
number of incorporated fusion proteins per phage particle greatly exceeds the number of co- 
transformed phage DNA or phagemids, each phage particle will display its cognate fusion protein 

30 along with other unrelated fusion proteins. This will result in an increase in background during the 
first round of sorting due to the capture of fusion proteins associated with phage containing 
unrelated DNA. If the phage captured in the first round are amplified by infection into E. coli at a 
low multiplicity of infection so that each E. coli cell is infected by only one phage, the correlation 
between displayed fusion protein and its DNA sequence will be restored. The second round of 

35 sorting will then eliminate DNA sequences which do not encode proteins with affinity for the 
target. 

In the case of monovalent display (e.g., protein display on protein-3) where the ratio of 
fusion protein to phage is less than one, each displayed fusion protein may associate either with its 



The saturation concentration for a given DNA preparation can be defined as the 
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round as described above. 

The use of DNA concentrations far above the saturation concentration also allows for. fre 
co-transformation of a cell with plasmids from different libraries in a single electroporatio. Thus 
the current invention can be used to facilitate and simplify any methodology which r^urres that a 
single cell maintain two or more plasmids from distinct libraries, since two or more hbranes can be 
introduced simultaneously rather than serially. 

For example, Griffiths et al. { EMBO Journal 13(14):3245-3260, 1994) used the process of 
c^binatorial infection and in +o recombination to increase the size of a phage anfib^y 
repertoire. Tbe process involved the separate electroporation of Ught chain and heavy cham 
repertoires to produce two distinct libraries; a fthd step combined the two Hbranes to y.Uth 
desired antibody library. Tne DNA concentrations achievable with the current mvenUon enables 
the co-transformation of the two libraries in a single electroporation. 

The transformed cells are generally selected by growth on an antibioUc, commonly 
tetracycline (tet) or ampicillin (amp), to which they are rendered resistant due to the presence of tet 
and/or amp resistance genes in the vector. 
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After selection of the transformed cells, these cells are grown in culture and the vector 
DNA may then be isolated. Phage or phagemid vector DNA can be isolated using methods known 
in the art, for example, as described in Sambrook et al., Molecular Cloning: A Laboratory Manual, 
2nd edition, (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. 
5 The isolated DNA can be purified by methods known in the art such as that described in section 
1 .40 of Sambrook et al., above and as described above. This purified DNA can then be analyzed 
by DNA sequencing. DNA sequencing may be performed by the method of Messing et al., Nucleic 
Acids Res., 9:309 (1981), the method of Maxam et al., Meth. EnzymoL, 65:499 (1980), or by any 
other known method. 

10 The invention also contemplates producing product polypeptides which have been 

obtained by culturing a host cell transformed with a replicable expression vector, where the 
replicable expression vector contains DNA encoding a product polypeptide operably linked to a 
control sequence capable of effecting expression of the product polypeptide in the host cell; where 
the DNA encoding the product polypeptide has been obtained by: 

15 (a) constructing a family of variant replicable plasmids containing a transcription 

regulatory element operably linked to a gene fusion encoding a fusion protein, wherein the gene 
fusion contains a first gene encoding a polypeptide and a second gene encoding at least a portion of 
a phage coat protein, where the variant replicable plasmids contain variant first genes encoding 
variant polypeptides; 

20 (b) transforming suitable host cells with the plasmids using the method of the invention; 

(c) optionally, when the plasmid is a phagemid which requires a helper phage to produce 
phage particles, infecting the transformed host cells with an amount of helper phage encoding the 
phage coat protein sufficient to produce recombinant phagemid particles which display the fusion 
protein on the surface of the particles, preferably where no more than a minor amount of the 

25 phagemid particles display one or more copies of the fusion protein on the surface of the phagemid 
particles; 

(d) culturing the transformed infected host cells under conditions suitable for forming 
recombinant phage particles containing at least a portion of the plasmid and capable of 
transforming the host cells; 

30 (e) contacting the recombinant phage particles with a target molecule so that at least a 

portion of the phage particles bind to the target molecule; 

(f) separating phage particles that bind to the target molecule from those that do not bind; 

(g) selecting one of the variant polypeptides encoded in a phage particle which binds or 
does not bind to the target molecule as the product polypeptide and cloning DNA encoding the 

35 product polypeptide into the replicable expression vector; and recovering the expressed product 
polypeptide; and product polypeptides produced by the process. 

U.S. 5,750,373 describes generally how to produce and recover a product polypeptide by 
culturing a host cell transformed with a replicable expression vector (e.g., a phagemid) where the 
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The structure and function of the major coat protein of filamentous bacteriophage have 
been studied in order to understand the interactions between phage DNA and the coat proteins, as 
well as to understand the forces which effect packing of the coat proteins into bacteriophage 
particles. Point mutations in the major coat proteins of filamentous bacteriophage have been 
5 prepared to assist in these studies. Hunter, E.J. et al., (1987) Nature, 327:252; Greenwood, J. et al., 
(1991) J. Mol. Biol 217: 223; Deber, CM. et al., (1993) Proc. Natl Acad. Sci. USA, 90: 1 1648; 
Symmons, M.S. et al., (1995) J. Mol. Biol. 245:86; Williams, K.A. et al., (1995) J. Mol Biol 
252:6; Spruijt, R. B. et al., (1996) Biochemistry 35: 10383; Marvin, D.A. (1998) Current Opinion in 
Structural Biology 8:150; Haigh, N.G. and Webster, R.E., (1998) J. Mol. Biol, 279:19. These 

10 studies suggest that some point mutations are tolerated by phage and result in packaging of phage 
particles containing the mutant major coat proteins. None of these studies involve fusion proteins 
of heterologeous polypeptides to variant phage coat proteins, however. Furthermore, it is known 
that the inclusion of fusion proteins in a phage coat may hinder phage packaging giving rise to poor 
phage yields and/or may prevent display of the fusion protein on the surface of the phage even 

1 5 when a wild type coat protein sequence is used (Smith, G. P. ( 1 985), Science, 228: 1 3 1 5). Whereas 
small peptides (10 - 15 amino acid residues) can generally be displayed in up to about 800 - 1000 
copies per virion, full length proteins are displayed in many fewer numbers (1-10 copies per 
virion). Malik, P. et al. (1996) J. Mol. Biol. 260:9. 

The sequences of several known mature major coat proteins of filamentous bacteriophage 

20 aligned with the mature M13 coat protein VHI are shown in the Table below. Segments of the coat 
proteins were aligned with Ml 3 protein VIII so as to provide maximum identity with the Ml 3 
protein without the introduction of any deletions or insertions. Numbering above the sequences 
refers to the residues of mature M 13 protein VUt. Protein sequences are taken from the Dayhoff 
protein database (accession numbers: M13, COAB_BPFD; Fl, COAB_BPFD; Fd, COAB_BPFD; 

25 Zj-2, COAB_BPZJ2; If-1, COAT_BPIFl ; 12-2, COAB_BPI22; Ike, COAB_BPIKE). Homologous 
residues are indicated with dashes. A sequence having a single deletion is also known (WO 
92/18619). It can be seen that there is considerable homology among the sequences of these coat 
proteins, particularly among the M13, fl, fd and Zj-2 coat proteins and among the Ifl , 122 and Ike 
coat proteins. 
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Filamentous phage particle are formed when the phage genes are transcribed, translated 
and replicated in a host cell. Phage coat proteins are directed to the periplasm and temporarily 
lodge in the cell membrane with a portion of the coat protein in the periplasm (the periplasmic 
5 domain), a portion of the coat protein in the cytoplasm (the cytoplasmic domain), and a portion of 
the coat protein spanning the cell membrane (the transmembrane domain). Phage particles are 
formed when the coat proteins assemble around the phage DNA as the phage particle passes 
through the cell membrane. The Ml 3 major coat protein contains 50 residues which can be 
divided into three regions: The periplasmic domain contains residues 1 to 20, the transmembrane 

10 domain contains residues 21 to 39, and the cytoplasmic domain contains residues 40 to 50 (Marvin, 
D.A. (1998) Current Opinion in Structural Biology 8: 150). The other major coat proteins in the 
table above have a similar domain structure. 

Surprisingly, applicants have also discovered that fusion proteins of heterologeous 
polypeptides to variants of the major coat proteins of bacteriophage are well tolerated in phage 

15 display systems. This result was unexpected since previous phage display systems have used the 
wild type coat protein sequences, generally of M 13 or fragments thereof, T4, T7 or lambda phage 
coat proteins. In one aspect of the present invention, phage display and selection have been used to 
obtain bacteriophage displaying fusion proteins on the surface thereof where the fusion protein is a 
heterologeous polypeptide fused to a phage major coat protein variant having one or more amino 

20 acid substitutions, deletions or additions. Fusion proteins having a heterologous polypeptide linked 
to a variant of the coat proteins in the Table above are within the scope of this invention. 

Preferred variants of Ml 3, fl and fd coat protein VIII contain at least one amino acid 
residue selected from the lists below in the position indicated: 

25 Residue Number Sample Substitutions Preferred Amino Acids 



1 


E, L, V, Q (neg. charged, hydrophobic, polar) 


D,I,N 


2 


R, H, F, W, E (charged, large aromatic) 


K,Y,D 


3 


T, E, L (small polar, neg. charged, hydrophobic) 


S,D,I,V,A 


4 


D, R, H (charged) 


E, K 


5 


R, H, N, D (charged, polar) 


K.Q.E 


6 


Y, W, S, I, L (aromatic, small polar, hydrophobic) 


F, T, V 


7 


T, N (small polar) 


S 


8 


D, H (charged) 


R,E,K 


9 


E, Q, T (neg. charged, polar) 


D,N,S 


11 


W, I, V (aromatic, hydrophobic) 


Y.L.F 


12 


R, H, N (charged, polar) 


E, D, K, Q 


13 


I, L, E, Q (neg. charged, polar) 


A, V, D.T.N.S 


14 


L (hydrophobic) 


I.V 
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, x E,K,H,Q 
15 D, R, N (charged, polar) 

E,V,L,T (neg. charged, hydrophobic, polar) D.I.A.S.G 

E V L (neg. charged, hydrophobic, polar) I. A> T « D 

. . . V, I 

lg L (hydrophobic) 

L.T.Q.E (hydrophobic, polar, neg. charged) I,V,S,A,N,D 
R.D.H (charged) N,Q,K,E 

L F V 

W Y I (aromatic, hydrophobic) » » 

F, Y 

22 W (aromatic) 

W Y I V, H, K (aromatic, hydrophobic, pos. charged) F, L, R 

1 ' ' L N V 

10 2 4 I, Q (hydrophobic, polar) ' ' 

25 S (hydrophobic, polar) 1 L ' 

0fx A, I, V (hydrophobic) Gy Li M 

T S 

97 N (small polar) 

V 

28 I, L (hydrophobic) 

K, R, F, W (aromatic, pos. charged) H > Y 



16 
17 



19 
20 
21 



15 29 



20 



25 



30 



35 



30 I, V (hydrophobic) 

In these tables, .he letter code refers to amino -^"^ A ^*^"^ 
asparagine or aspartic acid; C (Cys) cysteine; D (Asp) aspartic acid; E (glu) glutanuc ac.d; F (Phe) 
phenylalanine; G (Gly) glycine; H (His) histidine; I (He) isoleucine; K (Lys) lysine L (Leu) 
Line; M (Met) methionine; N (Asn) asparagrne; O (Xaa) stop codon; P (Pro) prohne; Q (Gin 

X (Xaa) unknown or non-standard; Y (Tyr) tyrosine; Z (Glx) gluamine or glutamic actd. 

As a part of this invention, it has been discovered that the amino acid sequence of phage 

ascomponentsofmsionproteinsinphagedisplaysystemsandmethods. Fusion protems 
eontainingvariantsofthemajor coat protein of a bacteriophage influence the abthty of phage to 
package the fusion proteins into complete virus particles (virions). That is, variants of the major 

Hy^er-fun^ 

proteins incorporated into a virus particle. Conversely, hypo-functional variants can be used to 
decrease fusion protein incorporation. In this way, the present invention provides a method for 

tailoring me incorpo^^ .„ . 

valency. This is particularly important for fusion proteins in whichtheheterologouspolypept.de , 
relatively large, for example, where me heterologous polypeptide contains 50 or more ammo 
prefembly 100 or more ammo acids, and even more preferably 200 or more amino acrd resxdues 
and also where the heterologous polypeptide is a protem havmg secondary and tertiary stmcture. 
The method of the invention, therefore, provides a means of overcoming the defictencres of pnor 

41 
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art phage display methods which utilize the major coat protein of a bacteriophage and which 
generally obtain only limited incorporation of the fusion protein into the virus coat. The fusion 
polypeptides of the invention are able to function in known phage display systems by substituting 
for the conventionally used wild type coat protein fusions with heterologous polypeptides. The 
fusion polypeptides of the invention will function in a similar manner to conventional fusion 
proteins in each of the known phage display systems, in which the fusion is with the major coat 
protein of the virus, further allowing one to select the degree of valency or number of fusion 
proteins displayed on the surface of the phage with more reliability. For example, the phage and 
phagemid vectors and the phage display systems described in U.S. 5,223,409; U.S. 5,403,484; U.S. 
5,571,689; U.S. 5,750,373, and U.S. 5,780,279 (and others noted above) can be modified to use the 
fusion proteins of the invention to improve display of peptides, proteins, antibodies and fragments 
thereof on the surface of phage. The phage is preferably a DNA phage. 

In addition to filamentous phage, the invention is suitable for use in phage display systems 
using lambda phage, Baculovirus, T4 phage and T7 phage. In each of these display systems, the 
coat protein used to display a heterologous polypeptide is mutated to form variants of the coat 
protein using the method of the invention and variants having the desired degree of display (hyper- 
functional or hypo-functional variants) are selected. The selected variant coat protein is then used 
to form a fusion protein with a heterologous polypeptide which is to be displayed on the surface of 
the virus particles. The scope of this invention includes the method(s) of the invention using these 
phage as well as fusion proteins, repiicable expression vectors containing a gene encoding the 
fusion protein, virus particles containing the fusion proteins or vectors, host cells containing the 
virus particles, fusion proteins or vectors, libraries containing a plurality of different individuals of 
these fusion proteins, vectors, virions, cells, etc. 

Polypeptides may be displayed on lambdoid phage using coat proteins in either the head or 
the tail portions of the phage particle (U.S. 5,627,024). Suitable head proteins include proteins pE, 
pD, pB, pW, pFII, pB* (a cleavage product of pB), pXI, and pX.2; suitable tail proteins include pJ, 
pV, pG, pM, and pT. The structure and location of these coat proteins is well known. See 
Georgeopoulos, et al. and Katsura in "Lambda II", R. W. Hendrix et al. eds. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y., 1983. Preferred lambda proteins for use in the invention 
are the tail coat proteins, particularly pV. U.S. 5,627,024 describes how to display polypeptides on 
lambda phage, preferably using pV. The fusion proteins of the invention, therefore, include at least 
a portion of variants of pE, pD, pB, pW, pFII, pB*, pXI, pX.2, pJ, pV, pG, pM, and pT fused to a 
heterologous polypeptide. 

Polypeptides can also be displayed on T4 phage. The structure of the T4 virion is well 
studied. See Eiserling in "Bacteriophage T4", C. K. Mathews et al. eds. American Society for 
Microbiology, Washington, D.C., 1983, pp 1 1-24. Peptides and full length proteins may be 
displayed as fusions with the SOC (small outer capsid protein) and the HOC (highly antigenic 
outer capsid protein) coat proteins of T4 phage. Further, the minor T4 fibrous protein fibritin 
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SeeRen.Z-J.et 31(1998)^215^^ 

128:44380; Ren, Z-J. et al. (1997) CAN 127:215644; Ren, Z-J. (1996) Protein So, 5:1833; and 
5 Efimov, V. P. et al. (1995) Virus Genes 10: 173. 

T7 phage may also be nsed to display polypeptides and proteins. Smith, G. P. and Scott, 
j K (1993) Methods in Enzymology, 217, 228-257; U.S. 5,766,905. Commercia! kits (T7Seleotl- 
1 and T7Select415-l from Novagen) are available for display of polypeptides as fusion protems 
with the lOBcapsidprotein (397 amino acids)and with the lOAcapsid protein (344 ammo acids). 

in size m nighcopynumber(415perphage),andprotemsup to about 1200 amino acuismlow 
copy number( 0 .l-lperphage).T7i S adoublestrandedDNAphage that has been extensively 
studied (Dunn, J.J. and Studier, F.W. (1983) J. MoL Biol. 166:477-535; Steven, A.C. and Tru , 
B L (1986) Electron Microscopy of Proteins 5 : 1-35). Phage assembly takes place mside the host 
15 (E colOcellandmaturephagearereleasedbycelllysis. Fusion proteins of heterologous 
polypeptides to variants of 17 coat proteins, such as 10B and 10A, vectors containing a gene 
lodingthe fusionprotein, etc. are within the of the invenUon. Preferably, fhs.onprotems are 
prepared by altering, preferably by mutating to a non-wild type amino acid, one or more of 

residues 1-348 of capsid protein 10B. 

The invention also includes fusion proteins of heterologous polypeptides wrth Bacu ovtrus 

coat protein variants. Baculovirus expression vectors, particularly those based on Autographa 

californica nuclear polyhedrosis virus, are easily generated and are now widely used forthe 

expressionofhetero^^^^ 

Possee R D (1991) J. Gen. Virol. 72:2967). These viruses contain a double stranded, circular 

Univ 40;45. It is possible to display a glycosylated eukaryotic protein on the surface of 
baculovirus particles, using a fusion with the baculovirus major coat protein gp64 or at least by 
fusing the heterologous polypeptides to the membrane anchorage domain of gp64 only. The 
efficiency of vanous promoters (polyhedrin, basic, ^promoter) have been exammed, mcludmg 
30 the "very late" polyhedrin promoter and the "early and late" gp64 promoter. In order to express a 
foreign gene on the surface of baculoviruses efficiently, it is necessary to choose a regulating 

hand start transcription early enough in the viral replication cycle, to guarantee efficient packaging, 
complete glycosylation and correct folding. 

As a further aspect of the invention, it has been discovered that phage display technology 
can be applied to themajor coat protein itself to generate useful major coat protein variants and 

constructed where the expression vector includes a transcription regulatory element operably 

4t 
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linked to a gene fusion encoding a fusion protein where the gene fusion contains a first gene 
encoding a first polypeptide and a second gene encoding a variant of the major coat protein of the 
bacteriophage used in the phage display system. That is, a library is constructed in which the 
second gene encodes for a plurality of variant phage major coat proteins and the phage display 
system or method is used to select for the variant sequence or sequences which give the desired 
degree of fusion protein surface display of a polypeptide. The degree of randomization of the 
major coat protein to produce the variants is optional. That is, libraries may be constructed in 
which each amino acid residue in the coat protein may be randomized to any amino acid or each 
residue may be limited to a subset of amino acids to produce a more limited library having 
predetermined constraints. It is also possible to construct a library in which a subset of residues are 
allowed to vary within a subset of amino acids, Le. selected residues are incompletely randomized. 
For example, it is possible to limit the range of variants for a particular amino acid residue to polar 
amino acids, hydrophobic amino acids, hydrophilic amino acids, aromatic amino acids, positively 
or negatively charged amino acids, sterically small or large amino acids, or to a particular desired 
combination of amino acids to obtain a smaller library having particular constraints. Any 
combination of amino acids may be used to prepare the desired libraries. 

It is also possible to produce libraries in which amino acids residues within desired 
segments of the major coat protein are varied to obtain a library of major coat protein variants 
having amino acid additions, substitutions or deletions within defined regions of the coat protein. 
As an example, the major coat protein may be divided into an arbitrary number of zones, generally 
2-10 zones, and a library constructed of variants within one or more of the zones. The mature 
major coat proteins of Ml 3, fl and fd phage, for example, contain 50 amino acids and might be 
divided into 10 zones of 5 amino acid residues each or into zones with unequal numbers of residues 
in each zone, e.g. zones containing 15, 10, 9, and 8 residues. Zones corresponding to the 
cytoplasmic, transmembrane and periplasmic regions of the coat protein may be used. A separate 
library may be constructed for each of the zones in which amino acid alterations are desired. If 
fusion proteins are desired in which the major coat protein variant has an amino acid alteration in 
zone 1 , for example, a single library may be constructed in which one or more of the amino acid 
residues within zone 1 is varied. Alternatively, one may wish to produce fusion proteins in which 
2 zones contain amino acid alterations. Two libraries, each library containing alterations within 
one of the 2 zones, can be prepared. 

The variant coat protein fusions will contain one or more alterations including 
substitutions, additions or deletions relative to the wild type coat protein sequence. Surprisingly, a 
large number of alterations are possible and are tolerated by the phage while retaining the ability to 
display polypeptides on the phage surface. Further, the chemical nature of the residue may be 
changed, i.e. a hydrophobic residue may be altered to a hydrophilic residue or vice versa. Variants 
containing 2 - 49, preferably 5 - 40, more preferably 7 - 20, altered residues are possible. As 
demonstrated by the construction of protein P12 below, any of the amino acids of a major coat 
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protein may be varied, including varying all residues ofthe coat protein. Fusion proteins 
containing any mature coat protein sequence or portion thereof which varies from the wdd type 
sequence of the coat protein or portion thereof is ^^of^^^ 
proteinvariantscontaining^.0, 6,7,8,9,10,11,12, 

50 variant residues are possible. Variants containing substitutions or only a few deletums are 
preferred since these variants will have about the same length as the wild type coat protem 
sequence. Variants which do not enable surface display ofthe heterologous polypept.de are 
selected against during the phage display, panning and selection process. 

10 The construction of such libraries for the M13 major coat protein is described in Example 

3- the selection of protein vm variants which increase the display of hGH or SAV is shown in 
Example 4. For library construction, the 50-residue protein VIE was divide* into five zones 
encompassing approximately ten contiguous residues each. A library was constructed for each 
Z one of the protein VIII moiety within the hGH- P rotein VIII fusion encoded by pS349. Most 

U positions were not fully mutated, but vanation was allowed at all non-lysine residue, Each 
library, encoding 10* possible protein VIH variants, contained at least 3 x l& independent 

transformants. U r»rn« 
The libraries were separately cycled through five rounds of binding selection on hGHbp- 
coated plates. Sequencing of individual clones revealed selectants from libraries encompassmg 
20 2 onesl,2,and3(Fig.2, THe zone 4 and zone 5 sorts yielded contaminants from other hbrane, 
me results suggest that zones 1, 2, and 3 are more tolerant to mutations and thus more smtab^ for 
use in the invention, but the results do not exclude the use of mutations in zones 4 and 5 for the 
purposes of the invention, and mutations in these zones are within the scope ofthe invenUoa 
Repeating the experiment with additional precautions to avoid contamination between hbranes wd. 
25 yieldvanantswimmut^ 
display. 

Selectants were extremely divergent from the wild type sequence, contammg seven 
.utationsonaverage. For zones 1 and 3, a strong consensus was not obtained. Tnree of four zone 
2 selectants were identical, but subsequent clones selected for streptavidin display (see below) 
30 yielded little consensus. Onlyone residue (Ala 10) was completely conserved as wild type, ,„ 
residues(Ala7,Leul4,Alal8 > Ile22,Met28,andVal30)showedconsensustowildtype. Eight 

positions showed consensus to a mutant sequence (E2K, D4E, P6F, OR, F11Y, G23R, A27T, and 

V29Y). 

All selectants increased hGH display; Figure 2 shows phage ELISA data for the best 
35 selectantfromeachzone. hGH display with the protein VIII variants produced detectable ELISA 

type protein VIE produced detectable ELISA signals with phage concentrations in the nanomolar 



WO 00/06717 



PCTAJS99/16596 



10 



15 



20 



25 



30 



range. Thus, the protein VIII variants increase the signal strength (and decrease the detection limit) 
by at least three orders of magnitude. 

Selection for SAV display was conducted with pooled libraries for zone 1, zone 2, and 
zone 3 of the protein VDI moiety within the SAV- protein VIII fusion gene. All selectants were 
from the zone 2 library, but consensus was minimal (Fig. IB). All selectants increased SAV 
display. Phage ELJSA data for the two best selectants is shown in Fig. 4; the variants provide a 50- 
fold increase in signal strength when assayed for binding to either anti-S AV antibody or to 
biotinylated BSA. 

In Example 8, hGH mutants with reduced binding affinity for hGHbp were displayed as 
fusions to either wild type protein VIII or variant protein VIII(la). Phage ELISA data are shown in 
Figure 3. As expected, reductions in binding affinity produce corresponding reductions in signal 
strength. When displayed on wild type protein VIII, the lowest affinity interaction (K<j = 820 nM, 
500-fold weaker than wild type) is barely detectable. The same interaction provides an extremely 
strong signal when displayed on the protein VTII variant. In fact, display of the lowest affinity 
interaction on the protein VIII variant provides an ELISA signal at least two orders of magnitude 
greater than that of wild type hGH linked to wild type protein Vm. 

Variants with alterations in more than one zone are possible as noted above. Such variants 
may be obtained by first obtaining a variant with mutations in a single zone (e.g., zone 1) and then 
using this variant as a template for a second round of selection in which another zone is mutated 
(e.g., zone 2). Thus, variants obtained by this process will have mutations in two zones (e.g., zones 
1 and 2). Alternatively, site-directed mutagenesis can be used to combine mutations from different 
variants into a single variant. For example, the mutations from a variant with mutations in zone 1 
can be introduced into a variant with mutations in zone 2 to produce a new variant with mutations 
in both zones 1 and 2. This process shown in Example 9. 

Obviously, both methods can be extended further to any number of zones which may 
eventually encompass the entire major coat protein sequence. Thus, it is possible to derive variants 
of the major coat protein with only minimal homology to the wild type. The potential for extreme 
variation in sequence in turn provides for extreme variation in function; variants ranging from 
extremely hyper-functional to extremely hypo- functional can be readily obtained. Thus, the 
invention can be used to tailor the display level of any heterologous protein fused to a coat protein. 
It is important to note that while the invention allows for the production of proteins with only 
limited homology to wild type coat protein, the new proteins are still variants of wild type protein 
(see definitions above) and are thus within the scope of this invention. 

If a variant major coat protein which improves display of a heterologous polypeptide on 
the surface of phage particles contains multiple mutations relative to wild type, it is also possible to 
obtain variants which display the heterologous polypeptide at levels intermediate between the 
levels obtained with the new variant and wild type major coat protein. This can be accomplished 
by separately back mutating each mutated amino acid of the variant back to the wild type sequence 
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5 This process is shown in Example 10. 

By ,s inito pr<>™ S s,i,»po SsM e„ob,™va ri 3 B ,s*h i chdisp li ,y M s l e™ 1 below tt « 

M of*, wild «. coat P».ein. For — * -» * — ta <>, ' e " °7 7" * 

,„ pratei. tenon* andean be isolated and fenced using town mated*. 
M „ 1M coa,p,o,ei«c.n,*ob,ai».dw m cb^ 
viial coat and thus reduce tenon preaein display tetenve .0 wild type coa, protein. M to » 
muB *» are ntade in rosiduc* which tend to be conserve ,s wild type in the above , *— - 
"Lions fochypntfuncton^ « (eg. AlalO, Ala7, L»14, M.1S. I,e22, Mem, and Vad30 . 

u rjvm,. L.rc.do.of^.rnsidu.eswiidcypedudngtteselectionforb^.ett.na, 

,evols Tbe heterologous peptide pctio. of Ute fusion protein nt.y be ,., nolypepnde o, proten 
2Q Itla.bncnpnn.docbo.dbyabi.dingp^. Sunnble nasion, -ay display »n «p„op« cag 
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proteins displayed in greater numbers on the virus particles will be preferentially bound by the 
target molecule. Separation of particles which bind to the target from those that do not bind 
enables one to obtain an enriched library of particles containing fusion proteins which display 
greater numbers of the heterologous protein on the surface of the particles. This panning process 
can be repeated multiple times, generally from 2-10, preferably 2 - 6, times to obtain further 
enriched libraries containing clones which contain fusion genes encoding fusion proteins of the 
heterologous polypeptide linked to the selected variants of the major coat protein, the variants 
which allow improved incorporation of the fusion protein into the virus coat in the phage display 
system. This method allows one to select major coat protein variants which are best able to display 
the particular heterologous polypeptide and optionally other polypeptides as a fusion protein on the 
surface of the phage particle. 

The clones selected by the process described above will have an improved capacity to 
display the heterologous protein on the surface of the phage in the phage display system. The 
method is, therefore, generally useful in displaying heterologous polypeptides which are difficult to 
display using fusion proteins based on wild type coat protein amino acid sequence. The improved 
fusion protein may be used in monovalent and/or polyvalent phage display systems. In polyvalent 
systems, the improved fusion protein can be used to increase the number of fusion proteins which 
are expressed on phage particles or to modulate the number of displayed proteins to a desired 
range. Phage particles displaying a greater number of fusion proteins will have a greater affinity 
for target molecules and will be preferentially bound and separated from particles which display 
fewer heterologous polypeptides as fusion proteins. This is useful in processes for the discovery of 
weakly binding ligands where the binding affinity of the ligand will be increased by known protein 
engineering techniques, such as the maturing of antibodies or antibody fragments by humanization, 
etc. In polyvalent display, the fusion proteins of the invention allow display of only a few to 
several hundreds of polypeptides on the phage surface. Generally, about 3 to about 50 
polypeptides will be displayed in a moderate display system. However, with the fusion protein and 
phage display system of the invention it is also possible to display about 50, preferably 100 - 900, 
and up to about 1000 polypeptides or more by selecting a coat protein variant through phage 
display which is capable of high display numbers. 

Vaccination techniques based upon phage expressing antigenic proteins fused to the coat 
protein have been described (Fanutti, C, et al. (1998) Biochem. Soc. Trans., 26: S8; Jiang, J., et al. 
(1997) Infect. Immun., 65:4770; Delmastro, P., et al. (1997) Vaccine, 15:1276; Galfre, G., et al. 
(1996) Methods Enzymol., 261: 109). This invention can also be used to enhance the effectiveness 
of phage vaccinations. The variants of the coat protein which increase expression of protein 
fusions on the surface of phage increase the antigenicity of phage vaccinations. Furthermore, the 
method can be used to generate variants of the coat protein which stimulate the immune system as 
haptens. Alternatively, the invention can be used to ameliorate immune response to the phage 
carrying the antigenic protein. 
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Alternatively, in monovalent display systems the method and fusion proteins of the 
invention can be used to preeisely tailor display of a protem to a level which is hi^ enough to 
allow for detection and enrichment of desired affinities, but is low enough to avoul avuuty effects 
associatedwithpolyvalency. v^^WV*^-^*™™*"' 

protein variant Proteins which do not display at all on wild type major coat protem (,e., cannot be 
detected as phage-associated entities) can be displayed monovalent* as fusions to an appropriate 
hyper-functional major coat protein variant. 

Having obtained a variant major coat protein which unproves/tailors display of the 
» heterologous polypept.de on the surface of phage partic.es, it is then possible to use conventions, 
phage display technologies to construct libraries of variants of the originally displayed 

heterologouspolype^^ 
Thefcsionproteinoftheinven^^ 

phage which provides the desired display characteristics, can be used to replace a fusion protem m 

15 conventionalphagedisp^^ 

a^oacidsequenceofthemajorcoatproteinorcoatproteinfragment. Replacement of he 

conventional fusion protein with the variant fusion protem of the invention improves the d,sp.ay 
heterologous polypeptide in the phage display system. That is, the coat protein portton has been 
optimized for the polypeptide which is displayed as a fusion protem. 

F urther,itispo SS ^ . 
protein obtained as described above with a second different heterologous polypeptide and mamt^n 

L benefits of improved incorporation of the fusion protein into virus particles. See Example 7. In 

tMs aspect of the invention, the fusion gene which encodes the original polypeptid^or coat 

protein variant fusion obtained by phage display panning and selection as discussed above rs 

25 .odifiedtoroplacemegeneencodmgmefn.tpolypeptidew.m 

peptide of interest. Conventional phage display Ubraries can the. be constructed m winch one 
or more residues of the second polypeptide are varied and selected by phage display panmng and 
selection to obtain variants oftheseeond polypeptide with the desired (e.g., improved) bmdmg 
properties. This result is also surprising since the variant coat protein portion of the fuston protem 

30 W asorigmally selected for the ab^ . 

Nevertheless, k ^^^^f^^^^^^^^ 
whichhasbeen selected forimproveddisplay of a heterologous polypeptide will also generally 

provide improved display of other unrelated heterologous polypeptides as well, even other 
polypeptides containing multiple subunits. The fusion protein containing the variant coat protem 
35 portion can, therefore, be used in general phage display systems. 
Thephagedisplaysystemofthem^^ 
areproducedasthempeuticpolypeptidesusing conventional recombinant DNA technology. In thrs 
embodiment, themethod described above isusedto identify a fusion protein contatmng a desued 
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coat protein variant portion for use in phage display. The heterologous polypeptide portion of the 
fusion protein may be the desired product polypeptide itself or may be a different polypeptide 
where this phage display step is used to select for the coat protein variant portion providing 
improved surface display as described above. Using a gene for the selected coat protein variant 
5 portion in the fusion gene together with a gene for a heterologous polypeptide, one can then use 
phage display to optimize and select from a library of different potential product heterologous 
polypeptide sequences to obtain a product polypeptide sequence. This product polypeptide 
sequence is then cloned into an expression plasmid containing a transcription regulatory element 
operably linked to a gene fusion encoding the product polypeptide. Expression of the gene fusion 
10 in mammalian or bacterial cells yields the product polypeptide using well known recombinant 
technology. See Sambrook et ah 

It is also within the scope of the invention to prepare fusion proteins of a heterologous 
polypeptide and a portion of a phage coat protein, which is not necessarily the major coat protein 
of the phage, used to display the polypeptide. In this embodiment, for example, a minor coat 
15 protein such as coat protein III of a filamentous phage is mutated to form families and libraries of 
fusion proteins and phage variants as described above and phage display selection and panning are 
used to obtain specific phage displaying a fusion protein of a heterologous polypeptide and at least 
a portion of a coat protein variant. Preferably, the coat protein portion is a mutant having at least 
one altered residue in the transmembrane domain or in the cytoplasmic domain of the coat protein. 
20 With respect to coat protein III, these altered residues will preferably be in the region of residues 
377 to 406 as counted from the amino terminal end of the mature coat protein III (Marvin, D. A., 
Filamentous phage structure, infection, and assembly, Current Opinion in Structural Biology, 
1998, 8: 150-158). Coat protein III variants may contain a plurality of variant residues as generally 
described above for major coat proteins. 
25 Suitable gene III vectors for display of polypeptides include fUSE5 (Scott, J. K. and Smith 

G. P. (1990). Searching for peptide ligands with an epitope library. Science 249, 386-390); fAFFl 
(Cwirla., S. E., Peters, E. A., Barrett, R. W., and Dower, W. J. (1990). Peptides of phage: A vast 
display library of peptides for identifying ligands. Proc. Natl. Acad. ScL U.S.A. 87, 6378-6382); 
fd-CATl (McCafferty, J, Griffiths, A., D., Winter, G., and Chiswell, D., J. (1990). Phage 
30 antibodies: Filamentous phage displaying antibody variable domains. Nature {London) 348, 552- 
554); m663 (Fowlkes, D., Adams, M., Fowler, V., and Kay, B. (1992). Mutlipurpose vectors for 
peptide expression on the M13 viral surface. Biotechniques 13, 422-427); fdtetDOG, pHENl 
(Hoogenboom, H., Griffiths, A., Johnson, K., Chisswell, D., Hudson, P., and Winter, G. (1991). 
Multi-subunit proteins on the surfaces of filamentous phage: Methodologies for displaying 
35 antibody (Fab) heavy and light chains. Nucleic Acids Res. 19:41 33-4 1 37); pComb3 (Gram, H., 
Marconi, L. A., Barbas, C. R, Collet, T. A., Lerner, R. A., and Kang, A. S. (1992) In vitro 
selection and affinity maturation of antibodies from a naive combinatorial immunoglobulin library. 
Proc. Natl. Acad. Sci. U.S.A. 89, 3576-3580); pCANTAB 5E (Pharmacia); and LamdaSurfZap 
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(HO**, H. H, Amberg, J. R, Hay, B. N, Sorge. J. A., and Shopes. B. (.993, Co*, in . 

Gm '"h^e'dU — * p^, P^s and — van- *»* «*. 

opCTabl , Unbed . . nasion .needing a «on po.yp.pdde, n— a— a M «■* 
Inning *. tnnafonned ceUs .o fonn phago paano.es which diaplay h P<dypeP« « <- «" 

a, _ a portion of ,h. p»,ic,e bind ,n ,bo .age., aepaming ft. pa**. «b,cb M l. 
, ft*, nan, do no, b W a« known and „ay bo aaeo W.U. .bo -fed of .bo «~ S« U* 
5 ,50 373- WO 97/09446. U.S. 5,514,548; U.S. 5,498,538; U.S. 5,516,637; U.S. 5,432,018, WO 
« 2 W U.S. 5.65S.727; U.S. 5,677,024; WO 97,29185; OTJoyle 01 „, .997, 

La- 4,; S— >» «. *. .994, *< — *— - «" 5 -"« °*» ^ 
* 5 C„. ft* *- «. Mohowshi, .993. 128:5.,,; Oon, 1 96 

Lbuay and Cananeo, ,995, 77*9, ^242-249; Cocao « a,., ,995, 0~. *« 
mJ^AD. « *. 1996, UK 20:509-516; Lind,.,, - Nadeoi, ,995, FBMS 
,7:33.39; Caabaon and WoUa, ,994, 22=37,-284; Baobaa, ,993, CW. C3p,, *» 4.5*. 

550; McGregor, ,996, Uol BloucH., 6:155-,6* Cnneae « „. ,996, Cnec. Op,, M. 7.6,6-621, 
20 McLaffertyetal., 1993, Gene, 128:29-36. 

£ Mpi POlTT^d. portion of «be fnsion paeaei. may oonlan, » few »a «0 oa 
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The heaenamgoua po.ypepdde ponion noay abao b. a poo* paefeaaWy . 
^ sneh ea . cyaoblne, and 4 e p„,on, »»y he se,eo,en ta. hnn^ gnowd, ho^.ne^H,, 
LeMony, ta- powtb h^one, bo*o *»wdn ho m o TO , pa^yan.d b« ftyenx^, 
35 ,«* A l», -in B^nln, pacing aobn* A-ehain, ae,»i» M« M 
^n bo^onea »oh »a MUdo s*n,ann 8 bo m »ne,FSH,, .by^.d — 
bloneCTSH), MM. ta— (Mfl. *-^-*« »— «*» ~ 
faclor vn,, .. andbndy, ,n» 8 ta » — «" 

51 



WO 00/06717 



PCT/US99/16596 



activator (t-PA), bombesin, coagulation cascade factors including factor VII, factor IX, and factor 
X, thrombin, hemopoietic growth factor, tumor necrosis factor-alpha and -beta, enkephalinase, 
human serum albumin, mullerian-inhibiting substance, mouse gonadotropin-associated peptide, a 
microbial protein, such as betalactamase, tissue factor protein, inhibin, activin, vascular endothelial 
5 growth factor (VEGF), receptors for hormones or growth factors; integrin, thrombopoietin (TPO), 
protein A or D, rheumatoid factors, nerve growth factors such as NGF- alpha, platelet-growth 
factor, transforming growth factors (TGF) such as TGF-alpha and TGF-beta, insulin-like growth 
factor-I and -II, insulin-like growth factor binding proteins, CD-4, DNase, latency associated 
peptide, erythropoietin (EPO), osteoinductive factors, interferons such as interferon-alpha, ,- 

10 alphacon-1, -beta, and -gamma, colony stimulating factors (CSFs) such as M-CSF, GM-CSF, and 
G-CSF, interleukins (ILs) such as IL-1, IL-2, IL-3, IL-4, IL-6, IL-8, IL-10, IL-12, superoxide 
dismutase; decay accelerating factor, viral antigen, HIV envelope proteins such as GP120, GP140, 
atrial natriuretic peptides A, B or C, Apo2L, novel erythropoiesis stimulating protein (NESP), 
ancestim, keratinocyte growth factor (KGF), brain-derived neurotrophic factor (BDNF), glial cell 

15 line-derived neurotrophic factor (GDNF), leptin, IL-1 receptor antagonist (IL-lra), soluble tumor 
necrosis factor-a receptor type I ( sTNF-RI), immunoglobulins, as well as variants and fragments 
of any of the above-listed proteins 

The heterologous polypeptide portion may also include a molecular tag, also known as an 
epitope tag, for identifying and/or capturing and purifying the fusion protein. For example, the tag 

20 may be Herpes simplex virus glycoprotein D (Paborsky et al. 1990, Protein Engineering, 3:547- 
553) which can be used to affinity purify the fusion protein through binding to an anti-gD antibody, 
protein A or a fragment thereof (Li et al. (1998) Mol Biotech., 9:187), a polyhistidine tag, e.g., 
(his) 6 (Sporeno et al., 1994, / Biol Chem., 269:10991-10995; Stuber et al., 1990, Immunol. 
Methods, 4: 121-152, Waeber et al., 1993, FEES Letters, 324: 109-1 12), etc., which can be used to 

25 identify and/or purify the fusion protein through binding to a metal ion (Ni) column 
(QIAEXPRESS Ni-NTA protein Purification System, Quiagen, Inc.). Other affinity tags known in 
the art may be used. 

In a particularly preferred embodiment, the heterologous polypeptide portion of the fusion 
protein is the light chain or the heavy chain of an antibody or fragments thereof, such Fab, F(ab') 2 , 
30 Fv, diabodies, linear antibodies, etc. The polypeptide may also be a single chain antibody (scFv). 
The preparation of libraries of antibodies or fragments thereof is well known in the art and any of 
the known methods may be used to construct a family of transformation vectors which may be 
transformed into host cells using the method and fusion protein of the invention. Libraries of 
antibody light and heavy chains in phage (Huse et al., 1989, Science, 246:1275) and as fusion 



35 proteins in phage or phagemid are well known and can be prepared according to known 
procedures. See Vaughan et al., Barbas et al., Marks et al., Hoogenboom et al., Griffiths et al., de 
Kruif et al., noted above, and WO 98/05344; WO 98/15833; WO 97/47314; WO 97/44491; WO 
97/35196; WO 95/34648; U.S. 5,712.089; U.S. 5,702,892; U.S. 5,427,908; U.S. 5,403,484; U.S. 
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54320,8; U.S. 5,270.170; WO 02,06176; U.S. 5,702,802. Review, have aho published. 
ZjL. .007, M .5:62,0 ; Net, . ... .005, C* ^ 

Jefferies, 1998, Parayito/ogy, 14:202-206. 

Specific untibodios contemplated as .h. heterologous polypopbde porta, include 
addles wbieh bind .o I— leubooyte surface - 
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^«^*cD^e». tt o*.*e»..T.e».ae»i^««»w,a»«. 

CDwl24, CD126-CD129, and CD130. 

0f.ee embody binding fargcu include oytokines and cytokine superfbmil, teccptots. 
^aopofe* grow* tacto, superfh-y raptors and preferably .he 
*_« which ace , group of elosely rel.fed glyeopcei. eel surface ,«eep«s « *- 
adorable homology uacludipg » a WSXWS dome* and ore g»» * *-* - 
mmtos of Ihe cytokine reo^Pa, srperfnmi.y laee e.g. Nicola « * M 67.M « - 
,„ ^d. RC».I flfflO J- 12:2645-2653 (1993)). Oonmlly, these targets are receptors fo, 

io r^xio^—g^ccsp,. 

not taired to. reeoprors (or: U <b and g ohaips) <HaP*.yan» - * 5 " 

,,980); Takeshi- a, .2. S*~. 257:370-382 ,100,)), M (Boh a, of., We. 247.324-32 

xr i w j rrej K7S459-5463 (1990); Kitamura e/ al, Cell, 
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W Sc. OH. 88:8641-g« 5 (.001), and *. reooptoPS fo, ppolaclin OJ-d. - . ^ « 
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(1989) ), growth hormone (GH) (Leung et al, Nature, 330:537-543 (1987)), ciliary neurotrophic 
factor (CNTF) (Davis et al, Science, 253:59-63 (1991) and c-Mpl (M. Souyri et al, Cell 63:1137 

(1990) ; L Vigon et al, Proc. Natl Acad. Sci. 89:5640 (1992)). Still other targets for antibodies 
made by the invention are erb2, erb3, erb4, IL-10, IL-12, IL-13, IL-15, tumor necrosis factor alpha, 

5 thrombin, etc. The variant coat protein fusions and the variant heterologous polypeptides and 
libraries containing the same can be prepared using conventional mutagenesis techniques. These 
methods include but are not limited to oligonucleotide-mediated mutagenesis and cassette 
mutagenesis. 



10 a peptide linker. A linker peptide segment will generally vary in length from about 3 to about 50 
amino acid residues, preferably from 5 to 30 residues, more preferably from 10 to 25 residues. 
Further, the net charge on the linker segment is preferably positive. The identity of and order of 
the amino acid residues is optional, although one or more specific sequences of the linker peptide 
segment will generally provide better display of the heterologous polypeptide. The method of this 

15 invention can also be used to modulate display levels of a fusion protein by mutating the linker 
between the fused protein and the coat protein and selecting linkers which afford the desired level 
of display. In this embodiment, a library of linker segment variants is made by mutating a linker 
sequence template and the linker sequences which give the best display on phage are selected using 
phage display selection, for example, an affinity selection for binding to the displayed heterologous 

20 polypeptide. Linkers which allow for greater numbers of displayed polypeptides will be selected 
based on increased affinity for the affinity matrix. 

To date, researchers have used specific linkers designed to provide desired attributes. 
Linkers have been designed to provide flexibility (Wung et al. (1997) J. Immunol Methods 204:33- 
41), such as the Gly-Ala3 (Holmes et al. (1996) Protein Pept. Lett. 3:415-422) or Gly4Ser3 linkers 

25 (Micheal et al. (1996) Immunotechnology 2:47-57) and to incorporate sites for specific proteolysis 
(Lucie, et al. (1998) Australia. J. Biotechnol 61:95-108; Matthews, D.J. and Wells, J. A. (1993) 
Science, 26:1113-1117). Considerations for linker optimization include, among other factors, 
resistance to proteolysis, distance from the phage particle to the fused protein, and conformational 
effects of the linker upon fusion protein activity. The large number of variables involved makes 

30 the selection method of the invention an attractive approach. For example, selection of a linker for 
increased display of hGH on protein VIII (Fig. 7A) results in increased display relative to a 
designed Gly/Ser linker (Fig. 8A). A similar selection was also used to select linkers (Fig. 7B) for 
SAV display (Fig. 8B). In the SAV display, the selection was performed with SAV fused to a 
protein Vm variant (protein VIII(le)) which had been previously selected for increased display of 

35 SAV (Example 4). This example demonstrates that optimized linkers can be combined with 
optimized coat protein variants, for example protein VIII variants, to obtain a desired display level, 
in a manner similar to the combination of different protein VIII variants described above. 



The heterologous polypeptide may be linked to the coat protein or portion thereof through 
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The methods described above with reference to obtaining a gene encoding a heterologous 
polypeptide, variant polypeptide or a fusion protein containing at least a portion of a phage coat 
protein and a heterologous peptide or variant and isolating the same, oligonucleotide-mediated and 
cassette mutagenesis, cleaving DNA using restriction enzymes, ligation, separation and selection of 
5 DNA using electrophoresis, purification and transformation (e.g., electroporation) procedures, 
library construction, suitable host or recipient cells and cell concentrations, etc. and the pnor art 
me thods noted above may be used in this embodiment of the invention and the description thereof 
is incorporated here specifically with respect to this embodiment. 
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c Carboy trrmirr' jfe and morengw phage coat proteins and fusions thereof 

Another aspect of the invention is the carboxyl-tenninal (C-terminal) display of a 
heterologous polypeptide on the surface of a filamentous phage using protein fusions with protein 
III or protein Vffl. C-terminal display has been reported on protein VI of M13 (Jespers, L. et aL, 
1995 Biotechnology 13:378-382). This paper states that protein VI is distinct from proteins ffl 
and VIH in its ability to allow for the attachment of polypeptides at the C-terminus. Surprisingly, 
as a part of this invention, it has been discovered that C-terminal display is also possible with 
fusions to protein m and VIH. The invention, therefore, allows the C-terminal display of a 
heterologous polypeptide or library of polypeptides in a manner similar to display at the N- 
terminus (N-terminal display) of a phage coat protein. In this aspect of the invention, the C- 
terminal display may be accomplished using a wild type protein HWIII or a mutant protein 
ffl/VHI as described above where the phage display process was applied to the coat protein 
sequence itself. 

Any of the methods of phage or phagemid display, creating coat protein variants and 
protein fusions thereof with a heterologous polypeptide, libraries of such variants and fus.on 
proteins, expression vectors encoding the variants and protein fusions, libraries of the vectors, a 
library of host cells containing the vectors, methods for preparing and panning the same to obtain 
binding polypeptides, etc. described above with reference to N-terminal display may also be used 
in this aspect of the invention for C-terminal display and the descriptions above are hereby 
incorporated here and should be considered as part of the description of C-terminal display of the 
30 invention. 

The invention allows one to evolve new virus particles containing non-w.ld type coat 

proteins and coat protein fusions. 

The variant protein m/VHI fusion proteins will contain one or more alterations including 
substitutions, additions or deletions relative to the wild type coat protein sequence. Again, it is 
35 surprisingly, that a large number of alterations are possible emd are tolerated by the phage while 

retaining the ability to display polypeptides on the phage surface, in this case as C-terminal fusions. 
The chemical nature of the residue may be changed, i.e. a hydrophobic residue may be altered to a 
hydrophilic residue or vice vers, Variants containing 2 - 50, preferably 5 - 40, more preferably 7 - 
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20, altered residues are possible. Fusion proteins containing any mature coat protein sequence or 
portion thereof which varies from the wild type sequence of the coat protein or portion thereof is 
within the scope of the invention. Major coat protein variants containing 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 
37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49 or 50 variant residues are possible. This aspect of 
the invention allows one to design a coat protein which is any coat protein other that the wild type 
coat protein and select for C-terminal fusion proteins which display on the surface of phage. 
Variants containing substitutions are preferred since these variants will have about the same length 
as the wild type coat protein sequence. However, deletions of residues to prepare shorter coat 
proteins and protein fusions thereof are within the scope of the invention. Preferably, the first few 
residues will be deleted, more preferably N-terminal or C-terminal residues 1 to about 5 can be 
deleted. Variants which do not enable surface display of the heterologous polypeptide are selected 
against during the phage display, panning and selection process. 

As with N-terminal display described above, it is also possible to produce libraries in 
which amino acids residues within desired segments of the coat protein are varied to obtain a 
library of coat protein variants having amino acid additions, substitutions or deletions within 
defined regions of the coat protein. As an example, the coat protein may be divided into an 
arbitrary number of zones, generally 2-10 zones, and a library constructed of variants within one or 
more of the zones. The mature coat proteins of Ml 3, fl and fd phage, for example, contain 50 
amino acids and might be divided into 10 zones of 5 amino acid residues each or into zones with 
unequal numbers of residues in each zone, e.g. zones containing 15, 10, 9, and 8 residues. Zones 
corresponding to the cytoplasmic, transmembrane and periplasmic regions of the coat protein may 
be used. A separate library may be constructed for each of the zones in which amino acid 
alterations are desired. If fusion proteins are desired in which the coat protein variant has an amino 
acid alteration in zone 1 , for example, a single library may be constructed in which one or more of 
the amino acid residues within zone 1 is varied. Alternatively, one may wish to produce fusion 
proteins in which 2 zones contain amino acid alterations. Two libraries, each library containing 
alterations within one of the 2 zones, can be prepared. 

Preferably, the heterologous polypeptide is attached to the coat protein or variant thereof 
through a linker peptide. The linker may contain any number of residues which allow C-terminal 
display, and will generally contain about 4 to about 30, preferably about 8 to about 20, amino acid 
residues. The linker may contain any of the naturally occurring residues, although linkers 
containing predominantly (greater than 50%) glycine and/or serine are preferred. The optimum 
linker composition and length for display of a particular polypeptide may be selected using phage 
display as described above and demonstrated in the examples. For example, phage libraries each 
containing a different linker length may be constructed and phage selection and panning used to 
isolate the amino acid composition of the linker of any length which optimizes expression and 
display of the heterologous polypeptide. 
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As with N-tenninal display described above, if a variant coat protein which improves 
display of a heterologous polypeptide on the surface of phage particles contains multiple mutates 

polypeptide at levels intermediate between the levels obtained with the new variant and wud type 
5 coatprotein. Ibis can be accomplished by separately bade mutating each mutated ammo acrd of 

generally reduce display levels of the heterologous polypeptide to levels varying between d^lay 
levels obtained with the variant and wild type coat protein. By combining the back mutatmns U ,s 
possible to tailor display to a desired level which is between that obtained with the variant and 

10 wild type coat protein. 

By a similar process, it is possible to obtain variants which display at a level below the 
,evel of the wild type coat protein. For example, mutations may be made in one or more zones and 
thelibrariesproducedpannedforphagewbichbmdonlywealdyCweaker^ 
wild type fusions). The weaker binding phage will be displaced by phage displaying wad type coat 
,5 protemfusionsandcanbeisolatedandsequencedusingknownmethods. 

Mutant coat proteins can also be obtained which are hypofunctional for incorporate mto 
the viral coat and thus reduce fusion protein display relative to wild type coat proteu, In this case, 
stations aremadeinresidues which tend to be conservedas wild type in the above desenbed 
selections for hyperfunctional variants. Conservation of these residues as wild type dunng the 
20 selection for hyperfunctional variants indicates that mutations at these residues are not well 

Variantsobtainedthroughmutauonsat 

these sites can then be screened for their ability to display a given fusion protein relaUve to the 
wild type coat protein display levels. Hypofunctional variants displaying the fusion at the desrred 
.educed levels relative to wild ty^ecanthen be used for the construction oflibrarieso the fusron 
25 proteinforthepurposesofphagedisplay. Although the preferred residues for the pro ucte of 
hypofunctional variants are those which were conserved as wild type, any residue of the coat 
protein can be mutated and the resulting variant tested for its ability to allow display of a fuse 
protein. In this way, it is possible to select a display level below that afforded by wild type simply 
by using the appropriate hypofunctional mutant. As with the hyperfunctional variants desenbe 

30 above, S evemlhypomnc^ 

to levels desired. While, the selection of hypofunctional variants requires a screen rather than a 

selecrion,memethodisrela^ 

activity rather increases and suitable screening procedures are known. Thus, most mutates m the 
coat protein should be deleterious mutations which result in hypofunctional vanants. 
35 C-terminal display is useful to display cDNA libraries on the surface of phage parucles. 

mRNA can be purified from a tissue source of choice and double stranded cDNAs synthesized 
using standard techniques (Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2nd 
Ed. Cold Spring Harbor, New York). A phagemid I or phage vector (or a plurality thereof) 
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containing an open reading frame is then constructed using well established techniques disclosed in 
Sambrook et al. and the phage and phagemid display references described above and the cDNAs 
are ligated into the vectors at the 3' end of the coat protein gene. Host cells are then transformed, 
preferably by electroporation, with the library of vectors and phage particles displaying 
heterologous polypeptides corresponding to the cDNA library members are obtained (with 
superinfection of helper phage for phagemid vectors). The C-terminal phage display library 
obtained may be panned and analyzed using conventional phage display techniques. 

The C-terminal display of the invention is also useful to display intracellular, preferably 
mammalian intracellular, proteins or fragments thereof and polypeptides which are difficult to 
display using N-terminal display. C-terminal display is, therefore, a complementary display 
technique to N-terminal display. Intracellular proteins may be difficult to display in a correctly 
folded form using N-terminal display due to the difference in redox environment in which 
intracellular proteins normally exist relative to the environment in which secreted proteins fold and 
form disulfide bonds. The cytoplasm is a reducing environment whereas the periplasm is an 
oxidizing environment. C-terminal heterologous fusion proteins migrate to the periplasm as in 
normal phage particle assembly. However, since the heterologous polypeptide remains on the 
intracellular side of the periplasmic membrane, an intracellular polypeptide may correctly fold 
prior to incorporation into a phage particle. During assembly of the phage or phagemid particle, 
the C-terminal fusion protein is incorporated into the particle and displays the heterologous 
polypeptide on the surface thereof. 

C-terminal display bypasses secretion problems encountered with N-terminal display 
systems. With N-terminal display, it is generally thought that the heterologous polypeptide on the 
N-terminus must pass through a pore-like structure in the periplasmic membrane in order to enter 
the periplasmic space with the C-terminus remaining as an anchor in the membrane. The fusion 
protein is then assembled into a phage particle from the membrane. Using C-terminal display, it is 
not necessary to have the fusion protein secrete into the host cell periplasm in order to assemble 
phage particles. C-terminal display is, therefore, useful to display any heterologous polypeptide 
and is particularly useful to display polypeptides which are difficult to display using N-terminal 
phage display techniques. 

C-terminal display can also be used to display a foreign polypeptide which is secreted into 
the periplasm during virus particle assembly. For example, by constructing a library of potential 
membrane proteins and selecting members of the library capable of functioning as coat proteins, it 
is possible to apply selective pressure to the library and evolve coat proteins which have a foreign 
polypeptide as a C-terminal fusion and which orient in the cell membrane with the foreign 
polypeptide in the periplasm and the N-terminus of the fusion protein in the cytoplasm. Such 
fusion proteins preferably have a positively charged N-terminal portion as a cytoplasmic region 
and a hydrophobic core portion as a transmembrane region. Such a structure resembles a bacterial 
secretion signal. Some library members will function as secretion signals and insert into the 
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bacterial membrane with the N-terminus in the cytoplasm and the C-terminus in the periplasm 
(inverse coat proterns). Some of the fusions which can insert into the membrane will incorporate 
into assembling virus particles by virtue of favorable interactions with the phage or phagennd coat. 
Suitable libraries may be designed in multiple stages. For example, inverse coat protems may be 

5 selected from a library (or libraries) of potential coat proteins by using an epitope tag fused to the 
C-terminus of the library of coat proteins. After phage particle assembly, antibody bindmg to the 
epitope tag is used to isolate members of the vims particle library which display the tag on the 
surface of the particle. The particles which bind can be isolated/selected and cloned using 
conventional phage display techniques. In a second step, one of the selectants can be further 

,0 evolved by phage display to select for improved incorporation into a particle coat. Again, one or 
more libraries can be constructed to vary different regions of the coat protein to select for those 
proteins which best display the epitope tag or other protein on the surface of the particle. Coat 
protein fusions and virus particles prepared according to the invention provide a diversity of tools 
which are useful to evaluate virus structure and assembly processes, to map the antibody bmdmg 

, 5 epitopes on proteins, to affinity mature binding proteins such as antibodies and fragments thereof 
to provide binding proteins with higher binding affinities, to produce polypeptides which bmd to 
active and allosteric sites on enzymes, etc. including all of the uses for which phage display is 
currently employed. 

All patent and literature references cited above are incorporated herein by reference in their 

20 entirety. 

EXAMPLES 



25 



30 
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Exam ple 1 - Construction of E cgli SS320 

The new cell line SS320 was prepared by bacterial mating in which the F episome was 
transferred from XLl-BLUE cells to MC1061cells according to known protocols (J. H. Miller, 
1972, Experiments in Molecular Biology, p!90). More specifically, the SS320 cells can be 

obtained using the following steps: 

-Grow 1.0 mL cultures of MC1061 and XLl-BLUE in LB broth to OD600 = 0.5 (single 
colonies from freshly streaked plates. MC1061 was streaked on LB plates. XLl-BLUE was 
streaked on LB/tetracycline (lOng/mL)). 

-Mix 0.5 mL of each culture and grow 1 hour at 37°C with slow shaking (50 rpm on a 
rotary shaker). After mating for 1 hour, agitate at 250 rpm to disrupt the mating. 
-Plate dilutions on LB/tetracyclme(10pg/mL)/streptomycin(10pg/mL). 
-MC1061 carries a streptomycin resistance chromosomal marker while the P plasmid of 
XLl-BLUE confers tetracycline resistance. Thus, only the mating progeny (MC1061 harboring 
the XLl-BLUE P episome will be resistant to both antibiotics. The unmixed cultures of MC1061 



WO 00/06717 



PCT/US99/16596 



and XL 1 -BLUE can be plated on the selective media as controls, since neither parent is doubly 
resistant. 

-The resulting strain (SS320) can be used for electroporation and phage production. 
The genotypes of the starting MCI 061 cells ( available from Bio-Rad Laboratories, Inc.) 
5 and XL1-BLUE episome (available from Stratagene, Inc. in XL1-BLUE cells) and the resulting 
SS320 cells are as follows: 

XL 1 -BLUE F episome 

F::TnlO proA+B + lacND(lacZ)M15 

10 MC1061 

F-araD139D(ara-leu) 7696galE15galKl 6D(lac)X74rpsL(StT^hsdR2(i^m\^)mcrAmcrBl 

SS320 

F::TnlO proA+B+lad<JD(lacZ)M15/ 
15 V-araD139D(ara4eu)7696galE15galKJ6D(lac)X74rpsL{S^ 

Various £. coli strains were evaluated for cell survival and viability after standard washing 
steps were performed to prepare the cells for electroporation. E. coli were grown in 250 mL 
cultures and prepared for electroporation as previously described. The total number of colony 
20 forming units were titered before and after the wash procedure and the results are shown below. 

Before wash After wash Survival 



Strain (cru) (cfu) (After/Before) 

SS320 4.8x10*2 4.6 xlO* 2 096 

TBI 1.7x10" 1.9x10" 11 

JM101 4.5x10" 3.6x10*0 0.08 

JM107 2.3x 10" 1.0 xlO* 0 0045 

JM109 1.5x10" 1.2x1010 0.08 



The transformation yield achieved with different E. coli strains at a fixed concentration of 
DNA is dependent on the concentration of viable E. coli cells in the electroporation reaction. It has 
25 now been discovered that the maximum concentration of viable cells which can be achieved with a 
given strain is dependent upon the resistance of that strain to the washing steps involved in the 
preparation of electrocompetent cells. The suitability of a particular bacterial (e.g., E, coli) strain 
for electroporation can be easily determined using the following procedure. 

Grow a 250 mL culture of bacteria to OD600 = 0.6 in a 1-L baffled flask. Remove a small 
30 aliquot and plate dilutions on appropriate media to determine the total number of viable cells; this 
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number is the input of cells (input, I). Follow the standard procedures for production of 
electrocompetent cells as described in Example 2 (scaled down appropriately for the volume of 
culture used). Plate dilutions of the final preparation of electrocompetent cells on appropna e 
m edia to determine the total number of viable cells; this is the number of ells surv.vmg the 

5 electrocompetent cell preparation procedure (survivors, S). Divide S by I to determine the rat,o of 
survwors to input (S/l). For a strain ideally suited for electroporation (i.e. a strain which gives the 
highest transformation yield in comparison with other strains at a fixed DNA concentrate), the 
ratioS/I should be equal to one. This indicates that all the input cells survived the electrocompetent 
cell preparation procedure. A reduction in the ratio S/l corresponds to a reduction m the 

,0 concentration of viable cells in the electrocompetent cell preparation; this in turn results m a 
reduction in the transformation yield. Thus, for the highest transformation yields at a given DNA 
concentration, a strain with the highest S/I value should be used. 

Fr o™ r i» -> - Pre paratio n nf F. ™li for electroporation 
15 ~" Electroporation competent cells were prepared as described below: 

1 Inoculate 1 mL 2xYT media(5 mg/mL tetracycline) with SS-320 from a fresh LB/tet 
p,ate. Grow about 6 hours and inoculate 50 mL 2xYT/tetracyline in a 500-mL flask; grow 

0VenU8 t Inoculate 6 x 900 mL Superbroth (5 mg/mL tetracycline) in 2-L baffled flasks with 5 
20 mL from above culture and grow cells to OD 5 00 = 0-6-0.8 at 37°C, 200 rpm. 

3. Chill three flask on ice (shake periodically). Further steps were performed m a cold 
room, on ice, with prechilled solutions and equipment. 

4. Centrifuge 5.5K/5 min in a SORVALL GS3 ROTOR and decant all supernatant. Add 
culture from remaining three flasks to same tubes; respin and decant. 

25 5. Resuspend in equal volume of 1 mM HEPES, P H7.4 by swirling or stirring. Centnfuge 

5.5K/10 min and decant supernatant. 

6 Resuspend in e^ual volume of 1 mM HEPES as in (5) above. Centrifuge 5.5K/10 mm 

and decant supernatant. Resuspend each pellet in 100 mL of 10% (v/v) glycerol (filter stenhzed; 

ultrapure glycerol (Gibco BRL #15514-01 1)). 
30 7 Centrifuge 5.5K/15 min and decant all supernatant. Resuspend in minimum volume of 

,0% glycerol. Using about 3 mL of 10% glycerol for 5 L of starting culture produces about 12 mL 

of concentrated cells with about 3-4 x 10 1 1 cfu/mL. 

Pv^npl^- Mutagenesis Fill-in ffl 
35 The mutagenesis reaction was conducted using the procedure desenbed in U.S. 5,750,373 

with the changes shown below: 



1) Kinase oligo 
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. 4 pL oligo (330 ng/mL stock; i.e., A26O = 10) 
4 pL 10XTM buffer (0.5M tris pH7.5, 0. 1M MgCl2) 
4pL10mMATP 
2 pL lOOmM DTT 
5 24pLH20 

2 pL kinase (NEB, 10 U/pL) 
40 pL 

-incubate at 37°C for 0.5 hour. 

10 2) Anneal oligo/template 
40 pg kunkel template 

1 .2 pg kinased oligo (i.e. 40 pL from above kinase reaction; oligo/template = 3) 
25 pL 1 OX TM buffer 
add H20 to 250 pL final volume 
1 5 -incubate at 90°C for 2 minutes, 50°C for 3 minutes. 

3) Fill-in 
-add: 

1 pL lOOmM ATP 

10 pL 25mM dNTPs (25mM each dATP, dCTP, dGTP, dTTP) 
15pL100mM DTT 
6 pL T4 ligase (NEB, 400 U/pL) 

3 pL T7 polymerase (NEB, 10 U/pL) 
-incubate at 20°C for 3 hours. 

Example 4 - E. coli Electroporation 

Electroporation was conducted as described below: 

1. Extract fill-in reaction with an equal volume of phenol/CHCl3. Extract with an equal 
volume of CHCI3. Purify and desalt DNA (60 micrograms) using QIAquick gel extraction kit 
(QIAGEN). Use two columns for each reaction. Follow wash and elution procedures as outlined 
by QIAGEN; elute each column with 30 pL of H2O to provide a final theoretical yield of 80 pg 
fill-in product (40 pg single strand DNA converted to double strand) in 60 pL of H2O. 

2. Electroporate DNA (60 pL) into 350 pL of competent E. coli SS320. Use 0.2 cm gaps 
cells with following settings: 2.5 kV, 200 ohms, 25 pF. Use two cells for each reaction (i.e. 200 
pL for each cell). After the shock, transfer cells to 25 mL SOC media and culture for phenotype 
expression. After phenotype expression, remove a small aliquot for titre on selective and non- 
selective media. Transfer cells to 500 mL 2xYT (in a 2-L baffled flask) containing appropriate 

4>3 
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antibiotic forphagemidselectionandVCS helper phage (m.o.i. = 10). Grow overnight and harvest 
phage in the morning. 

Example 5 - 1 ~~ libfiO constructionA ^tnlii.h DNA and F coli concentrations , 
5 The fill-in protocol of Example 3 was followed with two different single stranded 

template S (aandb)andamismatcholigonucleotide. Three different input template quantities were 

used: 1) 20 pg, 2) 30 pg, or 3) 40 pg. 

After the fill-in and purification the following double stranded DNA quantities were 



10 



15 



obtained in 60 pL of water 



Reaction 


OD260 


[DNA], pg/mL 


DNA total, pg 


al 


9.04 


452 


27 


a2 


12.12 


606 


36 


a3 


13.92 


696 


42 


bl 


7.48 


374 


22 


b2 


10.4 


520 


31 


b3 


13.12 


656 


39 



Each reaction was used to electroporate 340 pL of SS320 (3 x 10" cfu/mL). This gives a total 
volume of 400 pL with 1 x 10« cells. Each reaction was electroporated in two 200 microhter 
aliquots. 

Reaction [DNA], pg/mL Time constants, ms* 

al 



a2 
a3 
bl 
b2 
b3 

♦microseconds 



68 4.1,4.1 

90 4.2,4.3 

105 4.2,4.3 

55 4.2,4.2 

78 4.3,4.3 

98 4.2, 4.2 



These results indicate that the time constant for electroporations with high DNA 
concentrations are well above 3.0 ms and that electroporation is easily performed with high DNA 
20 concentrations. 

The number of transformants and the survival of the cells during transformauon was 



evaluated and is shown in the table below. 
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TITER OF LIBRARY SIZE AND SURVIVORS: 



Reaction 


survival 


transform 


S/I* 


T/S** 


T/I*** 


al 


5x 10 10 


2.3 x 10 10 


0.50 


0.46 


0.23 


a2 


5x 10 10 


3.0 x 10 10 


0.50 


0.60 


0.30 


a3 


4.5 x 10 10 


3.0 x 10 10 


0.45 


0.67 


0.30 


bl 


4x 10 10 


2.3 x 10 10 


0.40 


0.58 


0.23 


b2 


4xl0 10 


2.3 x 10!0 


0.40 


0.58 


0.23 


b3 


4x 10 10 


2.3 x 10 10 


0.40 


0.58 


0.23 



* survivors/input is the fraction that survive electroporation 
** transformants/survivors is fraction of survivors with phagemid 
5 ***transformants/input is fraction that both survive and transform. 
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Exam ple 6 - H i ph concente r ™ TtNA electroporation 

The standard fill-in protocol was followed with a single stranded template and a mismatch 
ohgonucleotide. Seven identical reactions were purified and pooled to produce 400 U L of DNA at 

750 ug/mL (OD260 = 1 5.0) 

Various amounts of DNA were electroporated into E. coli SS320 in a final volume of 200 

ML with a fixed E. coli concentration of 1.5 x 1 0 1 ^cells/mL. The following conditions were used: 
0 2 cm cuvettes @ 2.5 kV/cm, 200 ohms, 25 pF. After electroporation, the reaction was grown m 
10 mL SOC media for 30 minutes and then titered on both LB (survival) and LB/carbenicillin (50 
Mg/mL) (transformation). 



15 





[DNA] 


T.C. 


LB 


carb 


S/I* 


IIS** 


111*** 




(Mg/mL) 


(ms) 


(cfu) 


(cfu) 








1 


375 


3.6 


1.4 x 10 10 


6.0 x 10 9 


.47 


.43 


.20 


2 


281 


3.8 


1.4 xlO 10 


7.0 x 10 9 


.47 


.50 


.23 


3 


188 


4.1 


1.5 xlO 10 


6.2 xlO 9 


.50 


.41 


.21 


4 


150 


4.1 


1.4 xlO 10 


6.0 x 10 9 


.47 


.43 


.20 


5 


113 


4.2 


1.4 xlO 10 


5.7 x 10 9 


.47 


.41 


.19 


6 


75 


4.3 


1.5 xlO 10 


5.9 x 10 9 


.50 


.42 


.20 


7 


38 


4.4 


1.6 xlO 10 


6.8 x 10 9 


.53 


.43 


.23 


8 


19 


4.4 


1.6 xlO 10 


7.0 x 10 9 


.53 


.44 


.23 


9 


7.5 


4.5 


1.4 xlO 10 


5.0 x 10 9 


.47 


.36 


.17 


10 


3.8 


4.5 


1.5 x 10 10 


2.4 xlO 9 


.50 


.16 


.08 


11 


0 


4.5 


1.5 xlO 10 


0 


.50 


0 


0 



* survivors/input is the fraction that survive electroporation 
transformants/survivors is fraction of survivors with phagemid 
transformants/input is fraction that both survive and transform. 



** 
*** 
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acarn elg 7 - Censtju e tior. ofafl extSfflgly. lattfi library using multip le ^tro porations with 
■iltrahi ph DNA and E. coli c oncentrations 

The fill-in protocol of Example 3 was followed with a single stranded template and a 
mismatch oligonucleotide. The quantity of input template was 40 micrograms, and 35 identical 

reactions were performed. 

After the fill-in, purification was conducted as described in Example 4 except that the 
extractions with phenoVCHCl 3 and CHC1 3 were omitted, and each column was eluted with 50 
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microL of H2O. Two columns were used for each reaction, and thus, the final theoretical yield for 
each reaction was 80 micrograms of fill-in product in 100 microL of H2O. 

Each reaction was used to electroporate 700 microL of SS320 (3 x 10 1 1 cfu/mL). This 
gives a total volume of 800 microL with 2 x 10 1 1 cells. Each reaction was electroporated in two 
5 400 microL aliquotes as described in Example 4, except that the cells were transferred to 50 mL of 
SOC media after the shock. After phenotype expression, the cells were titred on selective media. 
When combined together, the 35 independent reactions provided a library size of 1.79 x 10 12 
different members. The results for the 35 independent reactions are presented below: 



AYvawllVSll 


transform ant <; 


1 


^ o x inlO 

J .1/ A IV/ 


2 


4 6 x 10l0 

*T.U A AVI 


3 


4 6 v IfilO 


4 


S 4 y I0l0 


5 


Ov mlO 


6 


5.8 x lO 1 ^ 


7 


4.6 x lO 1 ^ 


8 


5.0 x lO 1 ^ 


9 


5.0 x 10 i0 


10 


4.6 x 10 10 


11 


^ o y inlO 


12 


5.6 x 10 10 


13 


5.6 x 10 10 


14 


5.0 x 10 10 


15 


4.6 x 10* 0 


16 


4.6 xlO 10 


17 


5.8 x 10 10 


18 


5.6 x 10 10 


19 


5.0 xlO 10 


20 


5.0 xlO 10 


21 


4.8 xlO 10 


22 


4.8 x 10 10 


23 


5.0 xlO 10 



(f1 
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24 


5.2 x 10 10 


25 


5.6 xlO 10 


26 


4.6 x 10 10 


27 


5.4 xlO 10 


28 


5.4 x 10 10 


29 


5.8 x 10 10 




4.6 x 10 10 


31 


?nv \r\lO 


32 


5.4 xlO 10 


33 


5.8 xlO 10 


34 


5.0 xlO 10 


35 


4.8 x 10 10 



Materials for farther Examples 

Reagents for dideoxynucleotide sequencing were from United States Biochem.cals. 
Enzymes and plasmid P Mal- P 2 were from New England Biolabs. Maxisorp immnnoplates were 
from NUNC (Roskilde, Denmark). E. coli XLl-Blue was from Stratagene; the construct of E. 
coli SS-320 is described above. Bovine serum albumin (BSA), Tween 20, and o- 
phenylenediamine dihydrochloride were from Sigma. HRP/anti-M13 conjugate was from 
Pharmacia Biotech. Streptomyces avidinii was from ATCC (accession no. 2741 9). Goat ant- 
streptavidin polyclonal antibody was from Zymed Laboratories (South San Francisco, USA). 



nii r nn,ir.le.otides for Examples 8-21 

DNA degeneracies are represented in the IUB code (K = G/T, N = A/C/G/T, R = A/G, S - G/C, W 
= AT,Y = C/T). 

fPTG .j. AAAAGAATTCCCGACACCATCGAATGGTGC (SEQ ID NO. 9) 
15 IPTG^ ACCAGATGCATAAGCCGAGGCGGAAAACATCATCG (SEQ ID NO. 10) 
fPTG-3: TTTTCTAGACAGGCCTCCCACCAGATGCATAAGCCGAGGCGGAAAA 

CATCATCGTC (SEQ ID NO. 1 1) 

SAVdL GGCTATCGGAATGCATCGGGCATCACCGGCACCTG (SEQ ID NO. 12) 
SAV-2: GAGTCATAGTCGTCAGGCGCCTCCTCCGGATCCTCCACCCACCTT 

20 GGTGAAGGTGTCGTGG (SEQ ID NO. 13) 

hGH .j- GGGTATCTAGAGGTTGAG (SEQ ID NO. 14) 

hGH-2: TGGAGCTCCCGGATCCTCCACCGCTCTGGAAGCCACAGCTGCCCTC 
(SEQ ID NO. 15) 
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e8stovl\ GGATCCGGGAGCTCCAGCTGATGAGGTGACGATCCCGCAAAA (SEQ ID 
NO. 16) 

e8stop2: GATCCCGCAAAAGCGGCCTGATGATCCCTGCAAGCCTCAGCG (SEQ ID NO. 
17) 

5 s8stov3: CAAGCCTCAGCGACCGAATGATGAGGTTATGCGTGGGCGATG (SEQ ID NO. 
18) 

e8stop4: GCGTGGGCGATGGTTGTTTGATGAGTCGGCGCAACTATCGGT (SEQ ID NO. 
19) 

z8stop5: GCAACTATCGGTATCAAGTGATGAAAGAAATTCACCTCGAAA (SEQ ID NO. 
10 20) 

e8Vl: GGATCCGGGAGCTCCAGCRNTNASRNTNASNASNYCRNTRNARNT 
RNTTTTAACTCCCTGCAAGCC (SEQ ID NO. 21) 

s8V2: GATCCCGCAAAAGCGGCCNWTNASRNTNYTNASRNTRNTRNTRNTN 
ASTATATCGGTTATGCGTGG (SEQ ID NO. 22) 
15 e8V3: CAAGCCTCAGCGACCGAANWCNWCNKTNWCNYYTNKGNYTNKGN 
WTNWTGTCATTGTCGGCGCAACTATC (SEQ ID NO. 23) 
e8V4: GCGTGGGCGATGGTTGTTNWTNWCNWTNKTNYTNYTNNTNNTO 
AAGCTGTTTAAGAAATTCACC (SEQ ID NO. 24) 

e8V5: GCAACTATCGGTATCAAGNNGNNSAAGAAANNSNNGNNGAAANN 
20 GNNGTGATAAACCGATACAATTAAAGGC (SEQ ID NO. 25) 

e8(la): GATCCCGCAAAAGCGGCCTATGAGGCTCTTGAGGATATTGCTAC 
TAACTATATCGGTTATGCGTGG (SEQ ID NO. 26) 

R64A: CCGACACCCTCCAATGCTGAGGAAACACAACAGAAA (SEQ ID NO. 27) 
D171A: TTCAGGAAGGACATGGCTAAGGTCGAGACATTCCTG (SEQ ID NO. 28) 
25 Y164A/R178A: AACTACGGGCTGCTCGCTTGCTTCAGGAAGGACATGGACAAGG 
TCGAGACATTCCTGGCTATCGTGCAGTGCCGC (SEQ ID NO. 29) 
K172A/R178A: TTCAGGAAGGACATGGACGCTGTCGAGACATTCCTG 
GCTATCGTCCAGTGCCGCTCT (SEQ ID NO. 30) 

Lstop: GGTGGAGGATCCGGGAGCTGATGAGCCGAGGGTGACGATCCC (SEQ ID NO. 31) 
30 Lstop2: CACCAAGGTGGTCTAGAGCTAATAATAAGCCGAGGGTGACGA TCCC (SEQ ID 
NO. 32) 

LELGAGGGCAGCTGTGGCTTCGGTGGCGGTVVCVVCVVCVVCVVCV (SEQ ID NO. 33) 
VCWCWCWCWCWCWCWCVVCGGCGGTGCCGAGGGTGACGATCCC (SEQ ID 
NO. 34) 

35 LVSi CACCAAGGTGGTCTAGAGCVVCWCVVCWCWCGCCGAGGGTGACGATCCC 
(SEQ ID NO. 35) 

Z,F7Q. CACCAAGGTGGTCTAGAGCWCWCWCWCWCWCWCW 
CWCWCGCCGAGGGTGACGATCCC (SEQ ID NO. 36) 
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LV15: CAC CAAGGTGGTCTAGAGCWCWC WCWCVVCWC WCW 
CWCWCWCWCWCWCWCGCCGAGGGTGACGATCCC (SEQ ID NO. 37) 
LV2Q- CACCAAGGTGGTCTAGAGCWCWCWCWCVVCWCWCW 
CWCWCWCWCWCWCWCWCWCWCWCWCGCCGAGGGTGACGATCCC 



5 (SEQ ID NO. 38) 

LV25: CACCAAGGTGGTCTAGAGCWCWCVVCWC WCWCWCW 

cwcwcvvcwcwcwcwcwcwcwcwcvvcwcwcw 

CWCWCGCCGAGGGTGACGATCCC (SEQ ID NO. 39) 

?8 V2c: AAGTTCGCTAGAGATGCTTATGAGGCTCTTGAGGATATTGCTAC 

10 TAACTATATCGGTTATGCGTGG (SEQ ID NO. 40) 

9 8V3c: GAGGATATTGCTACTAACCTTTTCTTTCTCCTTGGGACTGTGC 

ATCTTGTCATTGTCGGCGCAACT (SEQ ID NO. 41) 

0 ,y 2 .E12N: GCAAAAGCGGCCTATAACGCTCTTGAGGATATT (SEQ ID NO. 42) 
^V2.D16A: TATGAGGCTCTTGAGGCCATTGCTACTAACTAT (SEQ ID NO. 43) 
15 ^V2-I17S: GAGGCTCTTGAGGATTCAGCTACTAACTATATC (SEQ ID NO. 44) 
p8 V2b: GATCCCGCAAAAGCGGCCTATGAGGCTCTTGAGGATATTGCTA 

CTAACTATATCGGTTATGCGTGG (SEQ ID NO. 45) 

L-wt: GAGGGCAGCTGTGGCTTCCAGAGCGGTGGAGGATCCGGGAG 

CTCCAGCGCCGAGGGTGACGATCCC (SEQ ID NO. 46) 
20 S13A/S17I: CCCGCAAAAGCGGCCTTTAACGCTCTGCAAGCCATTGCGACC 

GAATATATCGGTTATGCG (SEQ ID NO. 47) 

g 8V3b: CAAGCCTCAGCGACCGAACTTTTCTTTCTCCTTGGGACTGTGCATC 
TTGTCATTGTCGGCGCAACT (SEQ ID NO. 48) 

DIA: TCCGGGAGCTCCAGCGCCAAGAGTGAGAAGTTC (SEQ ID NO. 49) 
25 K2E: GGGAGCTCCAGCGATGAGAGTGAGAAGTTCGCT (SEQ ID NO. 50) 
S3G: AGCTCCAGCGATAAGGGTGAGAAGTTCGCTAGA (SEQ ID NO. 51) 
E4D: TCCAGCGATAAGAGTGACAAGTTCGCTAGAGAT (SEQ ID NO. 52) 
K5D: AGCGATAAGAGTGAGGATTTCGCTAGAGATGCT (SEQ ID NO. 53) 
F6P: GATAAGAGTGAGAAGCCCGCTAGAGATGCTTTT (SEQ ID NO. 54) 
30 R8K: AGTGAGAAGTTCGCTAAAGATGCTnTAACTCC (SEQ ID NO. 55) 
D9A: GAGAAGTTCGCTAGAGCGGCTTTTAACTCCCTG (SEQ ID NO. 56) 
YIIF: CCCGCAAAAGCGGCCTTTGAGGCTCTTGAGGAT (SEQ ID NO. 57) 
E12N: GCAAAAGCGGCCTATAAACGCTCTTGAGGATATT (SEQ ID NO. 58) 
A13S: AAAGCGGCCTATGAGTCCCTTGAGGATATTGCT (SEQ ID NO. 59) 
35 E15Q: GCCTATGAGGCTCTTCAAGATATTGCTACTAAC (SEQ ID NO. 60) 
D16A: TATGAGGCTCTTGAGGCCATTGCTACTAACTAT (SEQ ID NO. 61) 
77 7S: GAGGCTCTTGAGGATTCAGCTACTAACTATATC (SEQ ID NO. 62) 
N20E GAGGATATTGCTACTGAATATATCGGTTATGCG (SEQ ID NO. 63) 

10 
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L2JY: GCCTCAGCGACCGAATATTTCTTTCTCCTTGGG (SEQ ID NO. 64) 

F22I: TCAGCGACCGAACTTATCTTTCTCCTTGGGACT (SEQ ID NO. 65) 

F23G: GCGACCGAACTTTTCGGTCTCCTTGGGACTGTG (SEQ ID NO. 66) 

L24Y: ACCGAACTTTTCTTTTATCTTGGGACTGTGCAT (SEQ ID NO. 67) 

L25A: GAACTTTTCTrTCTCGCGGGGACTGTGCATCrT (SEQ ID NO. 68) 

G26W: CTTTTCTTTCTCCTTTGGACTGTGCATCTTGTC (SEQ ID NO. 69) 

T27A: TTCTTTCTCCTTGGGGCGGTGCATCTTGTCATT (SEQ ID NO. 70) 

V28M: TTTCTCCTTGGGACTATGCATCTTGTCATTGTC (SEQ ID NO. 71) 

H29V: CTCCTTGGGACTGTGGTTCTTGTCATTGTCGGC (SEQ ID NO. 72) 

L30V: CTTGGGACTGTGCATGTTGTCATTGTCGGCGCA (SEQ ID NO. 73) 

E12N/A13S: GCAAAAGCGGCCTATAACTCCCTTGAGGATATTGCT (SEQ ID NO. 74) 

EJ2N/I17S: GCAAAAGCGGCCTATAACGCTCTTGAGGATTCAGCTACT 

AACTATATC (SEQ ID NO. 75) 

AJ3S/I17S: CCCGCAAAAGCGGCCTATGAGTCCCTTGAGGATTCAGCTACT 
AACTATATCGGTTATGCG (SEQ ID NO. 76) 

E12N/A13S/I1 7S: GCAAAAGCGGCCTATAACTCCCTTGAGGATTCAGCT 
ACTAACTATATC (SEQ ID NO. 77) 

Example 8 - pS349: A phagemid for phage display of hGH on Protein VIIL 

A DNA fragment containing the gene for hGH was amplified using the PCR (with a 
derivative of plasmid pB0475 (Cunningham, B. C, Jhurani, P., Ng, P., and Wells, J. A. (1989) 
Science 243:1330-1336) as template and oligonucleotides hGH-1 and hGH-2 as primers). The 
DNA fragment was digested with Nsil and cloned into a protein VIII display phagemid (Lowman, 
H. B., Chen Y. M., Skelton, N. J., Mortensen, D. L M Tomlinson, E. E., Sadick, M. D., Robinson, I. 
C. A. F., and Clark, R. G. (1998) Biochemistry 37:8870-8878) which had been first digested with 
Kasl and treated with T4 DNA polymerase to produce blunt ends and subsequently digested with 
Afcil. The resulting phagemid was designated pS 135a. 

The PCR was used with primers IPTG-1 and IPTG-2 to amplify a 1.6 kbp fragment of 
pMal-p2 (New England Biolabs Product Catalog (1996-97) p212) containing the lacB gene and a 
gene fragment encoding the signal peptide from maltose binding protein under the control of the 
Ptac promoter. The DNA fragment was digested with £coRI and Nsil and ligated with the large 
fragment resulting from a similar digestion of pS135a. The resulting phagemid (designated pS349) 
contains a gene encoding a fusion product (the maltose binding protein signal peptide, followed by 
hGH, followed by a Gly/Ser-rich linker peptide (QSGGGSGSSS) (SEQ ID NO. 78), and protein 
VIII of E. coli bacteriophage M13 under the control of the IPTG-inducible Ptac promoter (New 

England Biolabs). In addition, pS349 also contains the lacft gene for effective transcription 
repression in the absence of EPTG. 
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A derivative of pS349 was constructed and designated P S657a. P S657a differs from 
P S349 in two respects. Firstly, the gene encoding hGH has bee. replaced by sequence encoding a 
pentapeptide (GGRPV) (SEQ ID NO. 79). Secondly, the introduction of an Xbal site in the linker 
5 precedingproteinvTIIhaschangedthecodonencodingglutamine to an amber (TAG) stop codon. 
Digestion with Nsil and Xbal excises the pentapeptide-encoding sequence and allows for the 
directional cloning of appropriately digested DNA fragments into a position analogous to that of 



thehGHgeneinpS349. 

A PCR was performed with Streptomyces avidinii genomic DNA as template and 
) oligonucleotides SAV- 1 and SAV-2 as primers. The amplified DNA fragment contained codons 16 
to 133ofthestreptavidin (SAV) gene open reading frame flanked by an Nsil site at the 5' end and 
an Xbal site at the 3' end. The fragment was digested with Nsil and Xbal and cloned into similarly 
digested phagemid P S657a. The resulting phagemid (pW277e) encodes a fusion product suntlar to 
that encoded by pS349 except that hGH has been replaced by SAV. Also, an amber codon has 
5 been positioned between the segments encoding SAV and protein Vffl. 

p^ pi. i n . ronstnictio " »f mutant protein VTTI libraries. 

For library construction, protein WI was divided into five zones encompassing 
approximately 10 contiguous residues each (zone 1, residues 1 to 10; zone 2, residues 1 1 to 20; 
,0 zone 3, residues 21 to 30; zone 4, residues 31 to 39; zone 5, residues 40 to 50). Libraries were 
constructed using a modified version (SS320 described above) of a previously descnbed method 
(Lowman, H. B. (1998) Phage Display of Peptide Libraries on Protein Scaffolds. From: Methods 
in Molecular Biology, vol. 87: Combinatorial Peptide Library Protocols. Edited by. S.Cabdly. 
Publisher Humana Press Inc., Totowa,NJ). Briefly for each zone, an oligonucleotide (gSstopn, 
25 where V is the zone number) was used with either pS349 (for hGH display) or pW277e (for SAV 
display) as template to introduce two consecutive TGA stop codons within the zone, using the 
method of Kunkel (Kunkel, T. A. (1985) Proc. Natl. Acad. Sci. USA 82:488-492). The resulting 
phagemid was used as template in a second round of the Kunkel method with a degenerate 
oligonucleotide (g8Vn, where V is the zone number) designed to introduce mutatrons at the 
30 desired sites. 

Libraries were constructed foreach zone of the protein Vm moiety within the hGH- 
protein Vffl fusion product of P S349. The diversities of these libraries were as Mows: zone-1, 
2 5 x 1010- zone-2, 2.5 x 10^; zone-3, 2.5 x 10">; zone-4, 1.3 x 10l0; and zone-5, 5.0 x 10*. 
Libraries were constructed for zone-1, zone-2, and zone-3 of the protein Vffl moiety within the 
35 SAV- protein Vffl fusion product of pW277e. The diversities of these libraries were as follows: 
zone-1, 3.0 x \0 9 ; zone-2, 6.8 x 10 9 ; and zone-3, 8.6 x 10 9 . 

Libraries were also constructed to vary the linker between the fused protein and protein 
Vffl. For hGH display, the oligonucleotide Lstopws used to introduce two consecutive TGA stop 
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codons within the linker between hGH and protein VIII. The resulting phagemid was used as 
template in a second round of the Kunkel method with a degenerate oligonucleotide (LV) designed 
to introduce a linker of the form (Gly)3(Xaa) l4(Gly)2 (where Xaa is a variable position), in place 
of the Gly/Ser-rich linker encoded by pS349. For SAV display, the linker was varied between 
5 SAV and variant protein VIII (le) (see Figure 2, for the sequence of protein VIII (le)). The 

oligonucleotide Lstopl was used to introduce three consecutive TAA stop codons within the linker 
between SAV and protein VIII ( le). The resulting phagemid was used as template for the 
production of libraries with linkers of variable length and sequence. The oligonucleotides LV5 t 
LV10, LV15, LV20, and LV25 were used to construct libraries with linkers containing 5, 10, 15, 20, 
10 or 25 variable residues, respectively. 

The diversities of the linker libraries were as follows: hGH-LV- protein VHI, 1.8 x 10 10 ; 
SAV-LV5- protein VHI, 1.4 x 10 10 ; SAV-LV10- protein VIII, 9.8 x 10 9 ; SAV-LV15- protein 
VIII, 1.2 x 10 10 ; SAV-LV20- protein VIII, 1.1 x 10 10 ; SAV-LV25- protein VIII, 6.0 x 10 9 . 

15 Example 1 1 - Selection of protein VIII variants which increase fusion protein display . 

Phage from the hGH- protein VIII libraries described above were cycled through rounds of 
binding selection withhGHbp (Fuh, G. etai (\99Q)J.Biol. Chem. 265:31 1 1; Cunningham, B. C, 
Ultsch, ML, De Vos, A. M„ Mulkerrin, M. G., Clauser, K. R., and Wells, J. A. (1991) Science 
254:821-825) coated on 96-well Maxisorp immunoplates as a target. All libraries were sorted 

20 separately. Phage were propagated in E. coli SS320 cells with M13-VCS helper phage 

(Stratagene). After five rounds of binding selection, individual phage were isolated and analyzed 
for hGH display using a phage ELISA with hGHbp as target (see below). Phage exhibiting strong 
signals in the phage ELISA were sequenced (Sanger, F. et ah (1979) Proc. Natl Acad.ScL USA, 
74:5463-5467). 

25 The SAV- protein VIII libraries were pooled and binding selection was performed as 

described above for the hGH- protein VIII libraries, except that the binding target was an anti-SAV 
polyclonal antibody. Phage were propagated in the SupE E. coli strain XL 1 -Blue in which the 
amber stop codon is suppressed as glutamine (Bullock, W. O., Fernandez, J. M., and Short, J. M. 
(1987) Biotechniques 5:376-379). 

30 

Example 12 - Site-directed mutagenesis . 

Mutageneses were performed using the method of Kunkel (Kunkel, et ai (1987) Meth. 
Enzymol. 154:367-382). Template DNA was prepared by growing an appropriate plasmid (e.g., 
containing the hGH gene fused to the carboxy-terminal half of Ml 3 gene IE) in host cells with 
35 M 1 3-K07 phage added as helper phage. Single-stranded, uracil-containing DNA was prepared for 
mutagenesis to introduce the desired mutation into the hGH-protein VIII gene fusion. 
Oligonucleotide-directed mutagenesis was carried out using T7 DNA polymerase and the 
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appropriate oligodeoxy-nucleotide, Clones from the mutagenesis were confirmed by dideoxy 

DNA sequencing. rw - 

He mutagenic oligonucleotide g 8(l a) was used to introduce the protein VHI mutaUons of 
se .ectanthGH-proteinVni(la)intophagemidpW277e. Mutant hGH genes were constructed 
using the appropriately named oligonucleotide (e.g.. oligonucleotide R64A encodes the mutaUon of 

Arg64toAla). . 

The mutagenic oligonucleotide S 8V2c was used to introduce the mutations of protem 
Vffl(2a) into the gene encoding protein Vffi(l a). He mutagenic oligonucleotide g 8V3c was used 
to introduce the mutations of protein Vffl(3a) into the gene encoding protein Vni(2a). The 
mutation E12N, D16A, or I17S was introduced into the gene encoding protein Vffl(2a) usmgthe 
mutagenic oligonucleotide g«K2-£J2tf, g8V2-D16A, or g 8V2-I17S. 

g^Elelj Phage H TS As for det< rminjng Hntivr levels of fusion protein disp la y on protein 

vttt »n<\ protein VTTT variants thereof. 

Cultures of E. coli XLl-Blue (BuUock, W. O., Fernandez, J. M., and Short, J. M. (1987) 

Biotechnia.es 5:376-379) harboring phagemids were grown for 8 hours at 37°C in 1 mL of 2YT, 
SOpg/mLcarbenicillin, 10 pg/mL tetracycline. The cultures were transferred to 30 mL of the 
same media ( supplemented with M13-VCS helper phage (lO* phage/mL) and IPTG at the 
appropriate concentration) for overnight growth at 37»C. Phage were harvested from the cuhure 
supenuttantbyprecip^^ 

Peptide Libraries on Protein Scaffolds. From: Methods in Molecular Biology, vol. 87: 
CornMnatorialPeptideLitraryProtocols. Edited by: S. Cabilly. Publisher: Humana Press Inc., 
Totowa, NJ) and .suspended in 1 mL of PBS, 0.2% BSA, 0.1% Tween (BSA blockmg buffer). 
Phage concentmtions were determined spectrophotometrically (e 2 68= Ux lOSM-W ). 

Maxisorp immunoplates (96-well) were coated with target protein for 2 hours at room 
tempemtu^lOOpLatS^ 

blocked for 1 h with 0.2% BSA in phosphate-buffered saline (PBS) and washed (8X) with ^ 
0.05% Tween 20. Phage particles were diluted serially using PBS, 0.2% BSA, 0.1% Tween 
blockingbuffer)andthen transferred (100 pL) to coated well, After 1 h, plates were washed (8X) 
, with PBS, 0.05% Tween 20, incubated with 100 pL of 1:3000 HRP/anti-M13 conjugate in BSA 
blocking buffer for 30 min, and then washed with PBS, 0.05% Tween 20 (8X) and PBS <2X> 
Plates were developed using an o-phenylenediamine dihydrochloride/H 2 0 2 solution (100 pL), 
stopped with 2.5 M H 2 S0 4 (50 pL), and read spectrophotometrically at 492 nm. Li, B. et al., 
(1995) Science 270:165-1660. 
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Example 14 - display of SAV on a protein Vm variant selected for increased display of hGH . 

The method of Kunkel (Example 12) was used with the mutagenic oligonucleotide g8V2b 
to introduce the protein VIII mutations of seiectant hGH-protein VIII(2a) into phagemid pW277e. 
The resulting phagemid encoded a fusion protein identical to that encoded by pW277e except that 
5 the protein Vm within the fusion moiety contained the mutations of variant protein VIII(2a). SAV 
display was measured by phage ELISAs (Example 13) with either anti-SAV polyclonal antibody 
(Fig. 4a) or biotinylated BSA (Fig. 4b) as target. 

Example 15 - display and detection of hGH variants with attenuated binding affinities . 

10 hGH mutants with reduced site 1 binding affinity for hGHbp (Pierce et al., above) were 

displayed as fusions to either wild type protein Vm or variant protein VTII(la). Mutant hGH genes 
were constructed using the method of Kunkel (Example 12) with the following mutagenic 
oligonucleotides: hGH(R64A), oligonucleotide R64A; hGH(D171 A), oligonucleotide DJ7JA; 
hGH(Y164A/R178A), oligonucleotide Y164A/R178A; hGH(K172A/R178A), oligonucleotide 

1 5 Kl 72A/R1 78 A. For display of hGH fused with wild type protein VIII, the mutagenesis template 
was pS349. For display of hGH fused with variant protein VTII(la), the template was a derivative 
of pS349 encoding a fusion protein consisting of hGH fused to protein VIII(la). hGH display was 
measured by phage ELISA (Example 13) with the hGHbp as target (Fig. 3). 

20 Example 16 - display of hGH with protein VIII variants combining mutations in different zones . 

The method of Kunkel (Example 12) was used to combine mutations from protein VIII 
variants independently selected for increased hGH display. The oligonucleotide g8V2c was used to 
introduce the mutations of protein VIH(2a) into the gene encoding protein VIII(la). The 
oligonucleotide g8V3c was used to introduce the mutations of protein VIH(3a) into the gene 

25 encoding protein VIII(2a). hGH display was measured by phage ELISA (Example 13) with an 
anti-hGH monoclonal antibody as target (Fig. 5). 

Example 17 - display of hGH with protein Vm variants derived from protein Vni(2a) through 
back mutations to the wild type sequence . 
30 The method of Kunkel (Example 12) was used to introduce the mutation E12N, D16A, or 

I17S into the gene encoding protein Vm(2a) using the mutagenic oligonucleotide g8V2-E12N t 
g8V2-D16A t or g8V2-I17S> respectively. hGH display was measured by phage ELISA (Example 
13) with an anti-hGH monoclonal antibody as target (Fig. 6). 

35 Example 18 - display of hGH with phagemid pS1607 . 

Further sequence analysis revealed that the fusion-protein VIII gene contained within the 
pS349 clone used in Figures 2, 3, 5, and 8 had a deletion of five base pairs consisting of the final 
base pair encoding the linker between hGH and protein Vm and the first four base pairs encoding 
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protein Vm. This deletion introduced a frameshift which reduced hGH display. The method of 
Kunkel (Example 12) was used with the mutagenic oligonucleotide Z-vW to correct the frameshifl 
inpS349. Theresultingphagemidwasd e signatedpS1607. Phagemid pS 1607 differs from pS349 
only in the addition of five base pairs to correct this frameshift. In pS349, the sequence following 
5 the hGH gene is as follows: C AGAGCGGTGGAGGATCCGGGAGCTCC AGAGGGT (the 

underlined bases are part of the beginning of the protein Vffl gene) (SEQ ID NO. 80). In P S1607, 
the corresponding sequence is as follows: 

CAGAGCGGTGGAGGATCCGGGAGCTCCAgcgccGAGGGT (lower case indicates the bases 
inserted through mutagenisis with oligonucleotide L-wt) (SEQ ID NO. 8 1 ). hGH display was 
10 measured by phage EUSA (Example 13) with the hGHbp as target (Fig. 9). 

Fvsm plB 19 - Fab display with a pro trin VTII variant. 

Phagemid pS1705a directs secretion of a free Fab light-chain and a Fab heavy-chain fused 
to protein Vffl. The Fab heavy-chain also contains a peptide flag (MADPNRFRGKDL) (SEQ ID 

15 NO. 82) fused to its N-tenninus which can be detected with a specific monoclonal antibody. The 
method of Kunkel (Example 12) was used with the mutagenic oligonucleotide S13A/S17I and the 
template pS1705a, the resulting phagemid was designated pS1709b. Phagemid pS1709b is 
identical to P S1705a except that the protein Vffl gene linked to the Fab heavy-chain contains the 
mutations S13A/S17I which were observed in protein Vffl variants selected for increased display 

20 of hGH (Fig. IB). Fab display was measured by phage ELISA (Example 13) with a peptide flag- 
specific monoclonal antibody as target. Fab display with P S1709b was greater than Fab display 
with P S1705a (Fig. 10). Thus protein Vffl mutations which were selected for increased display of 
hGH also increased Fab display. 

25 Example ™ - Hisnlav ojhGJI wjflj nrotein Vffl variants combining mutations in different zones . 

The method of Kunkel (Example 12) was used to combine mutations from protein Vffl 
variants independently selected for increased hGH display. The oligonucleotide g 8V2c was used to 
introduce the mutations of protein Vffl(2a) into the gene encoding protein Vffl(la). The 
oligonucleotide g8V3c was used to introduce the mutations of protein Vffl(3a) into the gene 

30 encoding protein Vffl(2a). The oligonucleotide g8V3b was used to introduce the mutations of 
protein Vffl(3a) into the gene encoding protein Vffl(la). The oligonucleotide g 8V3c was used to 
introduce the mutations of protein VIII(3a) into the gene encoding protein Vffl containing the 
mutations of protein Vffl(la) and protein Vffl(2a). hGH display was measured by phage EUSA 
(Example 13) with an anti-hGH monoclonal antibody as target. All protein Vffl variants increased 
35 hGH display in comparison with wild-type protein Vffl (Fig. 1 1). 
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Exam ple 21 - m odulated display of hGH with protein VIII variants derived from protein VIP 
selectants through back mutations to the wild-type sequence . 

Back mutation scanning is the independent conversion of each mutation within a coat 
protein, such as protein VIII, variant back to the wild-type sequence. A protein VTII selectant with 
mutations in either zone 1, 2, or 3 was subjected to back mutation scanning analysis. The 
following selectants were analyzed: protein VDI(la), protein VHI(2a), and protein VHI(3a) (Fig. 
1). The method of Kunkel (Example 12) was used to mutate each mutation in a given variant back 
to the wild-type sequence. Appropriately designed and named oligonucleotides were used (e.g. the 
oligonucleotide D1A mutates Aspl in protein VIII(la) to Ala). In addition, double and triple back 
mutations were introduced into protein VIII(2a), again using appropriately designed and named 
oligonucleotides (eg., the oligonucleotide A13S/I17S simultaneously mutates A 13 and 117 to Ser). 

hGH display was measured by phage EUSA (Example 13) with an anti-hGH monoclonal 
antibody as target. Some of the back mutations reduced hGH display, allowing for the modulation 
of hGH display (Fig. 12). 

Oligonucleotides for Examples 22-25 

Add-NX: GATGGTGAAGCTGCGGCTGATGCATCTGGTAGCGTCTAGAGC 
CACCATCACCATCACCAT (SEQ ID NO: 83) 

add-P12-7: GCTGTCGGTATTATTTACATGCTCCTCGTGGAGGCGTCGCCC 
TGGGCTGCTAAGGCGCCA (SEQ ID NO: 84) 

G-0: ACCTCGAAAGCAAGCCATCACCATCACCATGCG (SEQ ID NO: 85) 
G-7: ACCTCGAAAGCAAGCGGCCATCACCATCACCATGCG (SEQ ID NO: 86) 
G-2: ACCTCGAAAGCAAGCGGTGGCCATCACCATCACCATGCG (SEQ ID NO: 87) 
G-3: ACCTCGAAAGCAAGCGGTGGTGGCCATCACCATCACCATGCG (SEQ ID NO: 88) 
G-4: ACCTCGAAAGCAAGCGGCGGTGGTGGCCATCACCATCACCATGCG (SEQ ID NO: 
89) 

G-6: ACCTCGAAAGCAAGCGGTGGTGGCGGTGGTGGCCATCAC 
CATCACCATGCG (SEQ ID NO: 90) 

G- 7; ACCTCG AAAGC AAGCGGCGGTGGTGGCGGTGGTGGCCATC AC 
CATCACCATGCG (SEQ ID NO: 91) 

G8: ACCTCGAAAGCAAGCGGTGGCGGTGGTGGCGGTGGTGGCCATCACCATCACCAT 
GCG (SEQ ID NO: 92) 

G-9: ACCTCGAAAGCAAGCGGCGGTGGCGGTGGTGGCGGTGGTGGCCATCAC 
CATCACCATGCG (SEQ ID NO: 93) 

G-10: ACCTCGAAAGCAAGCGGTGGCGGTGGCGGTGGTGGCGGTGGTGGC 
CATCACCATC ACCATGCG (SEQ ID NO: 94) 

G-12: ACCTCGAAAGCAAGCGGTGGCGGTGGCGGTGGCGGTGGTGGCGGT 
GGTGGCCATCACCATCACCATGCG (SEQ ID NO: 95) 

"77 
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G-14: ACCTCGAAAGCAAGCGGTGGTGGTGGCGGTGGCGGTGGCGGT 
GGTGGCGGTGGTGGCCATCACCATCACCATGCG (SEQIDNO: 96) 
G-16: ACCTCGAAAGCAAGCGGCGGCGGTGGTGGTGGCGGTGGCGGTGGC 
GGTGGTGGCGGTGGTGGCCATCACCATCACCATGCG (SEQ ID NO: 97) 
5 G-18- ACCTCGAAAGCAAGCGGCGGTGGCGGCGGTGGTGGTGGCGGTGGC 

GGTGGCGGTGGTGGCGGTGGTGGCCATCACCATCACCATGCG (SEQ ID NO: 98) 
G-20- ACCTCGAAAGCAAGCGGTGGTGGCGGTGGCGGCGGTGGTGGTGGC 
GGTGGCGGTGGCGGTGGTGGCGGTGGTGGCC ATC ACCATCACC ATGCG (SEQ ID NO: 

99) 

,0 Lib-zonel: CAAGGACCATAGATTATGNNSNNSNNSNNSNNSNNSAAGTT 
TCTGAAAGTTTTTGTTTTT (SEQIDNO: 100) 

Lib-zonel: ATTATGAGCAAGAGCACTNNSNNSNNSNNSNNSNNSGTT 
TTTGTTnTTCTGTTGAT (SEQIDNO: 101) 

Lib-zonei: TTCAAAAAGTTTCTGAAANNSNNSNNSNNSNNSNNSNNSNNS 
15 NNSNNSAATTGGATTTGGGCTGTCGGT (SEQ ID NO: 102) 

Lib-zone4: GTTrTTTCTGTTGATGTTGATNNSNNSNNSNNSNNSNNSNNS 
NNSNNSNNSGCGGCTGATGCATTCCCA (SEQIDNO: 103) 
Lib-zoneS: TGGGCTGTCGGTATTATTNNSNNSNNSNNSNNSNNSNNSNNS 
NNSNNSGCTGCTAAGGCGCCAGACGATGGT (SEQIDNO: 104) 
20 Lib-zoneS: AGCGCTCAGCTGAGCAACTTCNNSNNSNNSNNSNNSNNSNNS 
NNSNNSNNSGCGGCTGATGCATTCCCA (SEQIDNO: 105) 
Lib-linker: GATGGTGAAGCTGCGGCTWCWCWCWCWCWCWC 
WCWCWCWCWCWCWCFATGCATTCCCAACTATACCA (SEQIDNO: 106) 
Pep-ins- ACTTTCAAAAAGTTTCTGAAANWTNKTNWT^mTmmT 
25 nwtnwTNWTNWTNWTNK 
AG (SEQIDNO: 107) 

rinm f . 22 rw^wion of opt imal linker leng th for t hn tti-ph Y of o PH*^ <" S ed to the C 
terminus of pro tein VIII. 

30 ' Standard molecular biology techniques were used to construct a phagemid designated 
pS1290a. P S1290a is identical to phagemid pS349 (see Example 8) except that the open reading 
frame (ORF) under the control of the IPTG-inducible Ptac promoter (New England Biolabs) has 
been deleted and replaced by a new ORF. The new ORF encodes a fusion product consisting of 
the maltose binding protein signal peptide, followed by a Ser residue, followed by residues 2-50 of 

35 matureproteinVmof£.c 0 /ibacteriophageM13. The ORF is followed by two TAA stop codons, 
followed by sequence (CACCATCACCATCACCATGCG) (SEQ ID NO: 108) encoding a 
heptapeptide (HHHHHHA, hexaffis) flag or epitope tag, followed by two stop codons (TGATAA). 
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pS1290a was mutated using the method of Kunkel (Example 12). The two TAA stop 
codons and the first His codon following the protein VIE C-terminus were replaced by various 
numbers of Gly codons. Appropriately designed and named mutagenic oligonucleotides were used 
(e.g., oligonucleotide G-6 inserts six Gly codons). This resulted in the construction of a series of 
5 phagemids encoding ORFs designed to secrete protein VIII molecules with C-terminal fusions 
consisting of linkers containing varying numbers of Gly residues followed by a pentaHis flag 
(HHHHHA). The number of Gly residues was varied from zero (i.e., the polyHis flag was fused 
directly to the protein VITJ C-terminus) to 20. PentaHis flag display was measured by phage 
ELISA (Example 13) with an anti-(His)5 antibody (Qiagen) as the capture target (Fig. 13). 

10 

Example 23 - Optimization of the linker sequence for display of a peptide fused to the C-terminus 
of protein Vffl. 

Libraries were constructed to vary the linker between protein VIII and the hexaHis flag 
encoded by pS 1290a. Libraries were constructed using a modified version of a previously 

15 described method (see Example 10). Mutagenic oligonucleotides were used to replace the two 
TAA stop codons between protein VHI and the hexaHis flag with libraries of linkers. The lengths 
of the linkers were varied and depended on the mutagenic oligonucleotides used: oligonucleotides 
UH-L4 y UH-L5, UH-L6, UH-L8, or UH-L10 introduced linkers containing 4, 5, 6, 8, or 10 residues 
respectively. The total diversity of the linker libraries was 5.7 x 10 10 . 

20 Phage from the linker libraries described above were pooled together and cycled through 

rounds of binding selection with an anti-(His)4 antibody (Qiagen) as the capture target. After two 
rounds of selection, individual clones were assayed for hexaHis flag display using a phage ELISA 
with the anti-(His)4 antibody as target. Clones exhibiting the strongest signals were subjected to 
DNA sequence analysis and the linker sequences were deduced from the DNA sequence and are 

25 shown below. 





GCC 
A 


TGG 
W 


GAG 
E 


GAG 
E 


AAC 
N 


ATC 
I 


GAC 
D 


AGC 
S 


GCC 
A 


CCC 
P 


(SEQ ID NO. 109) 
(SEQIDNO. 1 10) 


30 


CAG 
Q 


TAC 
Y 


GGG 
G 


ACG 
T 


CCG 
P 


GAC 
D 


ACC 

T • 


GAC 
D 


ACC 
T 


GAC 
D 


(SEQ ID NO. Ill) 
(SEQIDNO. 112) 


35 


ACG 
T 


GGG 
G 


TGG 
W 


TTG 
L 


GAG 
E 


GGG 
G 


CCC 

p 


GAC 
D 


ACC 
T 


CCC 
P 


(SEQIDNO. 113) 
(SEQIDNO. 114) 


CTC 
L 


ATG 
M 


GGC 
G 


CCC 
P 


GGC 
G 


GCG 
A 


GAC 
D 


GGC 
G 






(SEQIDNO. 115) 
(SEQIDNO. 116) 


40 


CAC 
H 


GAC 
D 


TCG 
S 


GTC 
V 


CCG 
P 


AGC 
S 


AAC 
N 


GGC 
G 






(SEQIDNO. 117) 
(SEQIDNO. 118) 



Linkers selected for display of a peptide fused to the C-terminus of protein VIII. The 
sequences shown were inserted between the final residue of protein VIE and a heptapeptide 
(HHHHHHA, referred to as a hexaHis flag). For each selectant, the DNA sequence is shown with 
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10 



15 



20 



25 



30 



35 



the deduced amino acid sequence below. The numerical designation for each sequence is shown to 
theleft. 

The level of polyffis flag display achieved with an optimized linker was compared with 
levels of display achieved with poly-glycine linkers of various length (Example 22) using a phage 
ELISA with an anti-(His)5 antibody (Qiagen) as the capture target (Fig. 14). 

Fvam ple 24 - nwrinn and se' "^'"" of " "ew nhare coat protein ! 
polypeptide* as C-termin al fusions. 

This example demonstrates the de novo design of a phage coat protein and the display of a 
protein of interest on the surface of phage particles containing the fusion protein, illustrating the 
broad scope of the method of the invention to prepare any variant phage coat protein. The 
retrotranslation of a peptide is the backward reading of the primary sequence and the resultmg 
peptide is the retro-peptide of the original peptide. For example, the retrotranslation of the pepUde 
Gly-Ala-Leu is the retro-peptide Leu-Ala-Gly. 

Standard molecular biology techniques were used to construct a phagemid designated 
P Sl207a P S1207a is identical to phagemid P S349 (Example 8) except that the QRF under the 
control of the IPTG-inducible Ptac promoter (New England Biolabs) has been deleted and replaced 
by a new ORF. The DNA sequence of the new ORF is as follows: 

ATGAGCAAGAGCACTTTCAAAAAGTTTCTGAAAGAGACTGCTAGCGCTCAGCTGAGC 
AACTTCGCTGCTAAGGCGCCAGACGATGGTGAAGCTGCGGCTCACCATCACCATCACC 

ATGCG (SEQ ID NO: 119) 

The new ORF encodes the following polypeptide: 
MSKSTFKKFLKETASAQLSNFAAKAPDDGEAAAHHHHHHA. (SEQ ID NO: 120). 

This ORF was designed as follows. The first two residues were (Met-Ser) chosen to allow 
good translation initiation. This dipeptide was followed by a retrotranslation of residues 40-18 of 
mature protein Vffl from M13 bacteriophage (KLFKKFTSK retrotranslated to KSTFKKFLK) 
which was in turn followed by a retrotranslation of protein VHI residues 1-20 
(AEGDDPAKAAFNSLQASATE retrotranslated to ETASAQLSNSAAKAPDDGEA). To the C- 
terminus of this polypeptide was fused a nonapeptide (AAHHHHHHA) hexaHis flag. Thus, tins 
ORF consists of the dipeptide Met-Ser, followed by a retrotranslation of residues 1-48 of mature 
protein VIII with the central hydrophobic section (residues 21-30) deleted, followed by a hexaHis 
flag. 

A library of 19-mer peptides was inserted between residues 1 1 and 12 of the above 
described ORF, using a modified version of a previously described method (see Example 10) with 
pS1207a as the template and Pep-ins as the mutagenic oligonucleotide. The resulting hbrary 
encoded ORFs with the following sequences: 

MSKSTFKKFLK-(x) 1 9-ETASAQLSNFAAKAPDDGEAAAHHHHHHA (SEQ ID NO: 121) 
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10 



15 



20 



25 



where "(x)19" indicates a random 19-mer peptide library. The degenerate codons used at each 
position within the library are shown in Figure 15. The library diversity was 8.3 x 10 10 . 

Phage from the library were cycled through rounds of binding selection with an anti-(His)4 
antibody (Qiagen) as the capture target. After three or four rounds of selection, individual clones 
were assayed for hexaHis flag display using a phage ELISA with either the anti-(His)4 antibody or 
bovine serum albumin (BS A) as target. Of 72 clones assayed, six exhibited at least a two-fold 
greater signal when captured with the anti-(His)4 antibody rather than with BSA (Fig. 16). These 
clones were subjected to DNA sequence analysis and the protein sequence was deduced from the 
DNA sequence. 

These protein sequences represent a new class of phage coat proteins which we designate 
"Protein- 12" (P12). The individual unique clones are designated by an additional numeral (eg, 
Protein-12-1 or P12-1). As shown in Figure 16, peptides fused to the C-terminus of P12 are 
displayed on the surface of Ml 3 phage. The phagemid containing the gene encoding PI 2-1 was 
named pS 1230a. 

Example 25 - Selection of a second generation P12 for the display of a large protein as a Cterminal 
fusion . 

The method of Kunkel (Example 12) was used with the mutagenic oligonucleotide add-NX 
to insert an Nsil restriction site followed by an Xbal restriction site into phagemid pS 1230a, 
between the regions encoding PI 2-1 and the polyHis flag. The resulting sequence was as follows: 
. . . gctgcggctG ATGCAT CTGGTAGCG TCTAGA GCcaccatcaccatcaccat. . .(SEQ ID NO: 122) 
The inserted sequence is shown in upper case text with the Nisi sndXbal restriction sites. The 
inserted sequence is preceded by sequence encoding the final residues of PI 2-1 and followed by 
sequence encoding the polyHis flag (both shown in lower case text). The new phagemid was 
designated pS 1232a. 

Phagemid pS 1232a was digested with Nsil and Xbal and a similarly digested DNA 
fragment encoding an hGH variant (hGH supermutant, hGHsm) with improved afffinity for the 
hGH binding protein (hGHbp) was inserted. The phagemid was designated pS 1239b; it contains 
an ORF encoding P12-1 followed by a tetrapeptide linker (Ala- Ala- Asp- Ala), followed by hGHsm 
as shown below. The protein product of the pS 1239b ORF is depicted; it consists of PI 2-1, 
followed by a tetrapeptide linker (AADA), followed by hGHsm. P12-1 was divided into six zones 
as indicated, and a library was constructed for each zone. In addition, a linker library was 
constructed in which random 14-residue peptides were inserted in the middle of the tetrapeptide 
linker as shown. 
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M S K 8 T F K K F L X 

i ry;. - 1 

zone 1 I ^ 

^ zone 2 



10 vFVFSVDVDNNWIWAVGII 

i ; i 

zone 3 I 

zone 4 

15 

E T A S A Q L S _F_A_ f « _ ° A--. | 

I " zone 6 

20 zone 5 



25 A A D A | ---hGHsm 
I 

linker library {SEQ ID N0 . 123) 

(insert 14 WC condons between Ala-Asp) 

30 Phage particles produced from P S1239b did not display hGHsm at levels detectable in a phage 
ELISA with hGHbp as the capture target. 

To obtain a P12 variant capable of displaying hGHsm as a C-terminal fusion, libraries 
were constructed to vary the sequence of PI 2-1 encoded by P S 1239b. For library construction, a 
previously described method was used (Example 10). P12-1 was divided into six zones with each 
35 zone containing a stretch of contiguous residues (zone 1, residues 2 to 7; zone 2, residues 6 to 1 1; 
zone 3, residues 12 to 21; zone 4, residues 21 to 30; zone 5, residues 31 to 40; zone 6, residues 41 
to 50) Oligonucleotides were designed to simultaneously replace all codons within the zone with 
an equal number of degenerate codons (NNS, where N = A, C, G, or T) encoding all twenty natural 
amino acids. Each oligonucleotide was named according to the zone it mutated (e.g. 
40 Oligonucleotide Lib-zonel mutated zone 1). In addition, an oligonucleotide (Lib-linker) was 
designed to insert 1 4 degenerate codons (WC, where V = A, C, or G; encoding Ala, Arg, Asn, 
Asp, Gly.His, Pro, Ser, or Thr)into the middle of thetetrapeptide linker connecting PI 2-1 to ^ 
hGHsm The diversities of these libraries were as follows: zone 1, 2.5 x 10 10 ; zone 2, 2.5 x 10 ; 
zone 3, 2.6 x 10* zone 4, 2.4 x 10»; zone 5, 2.4 x 10'°; zone 6, 2.3 x 10'°; linker library, 2.8 x 

45 10 10 . , . 

Phage from all the libraries were pooled and cycled through rounds of bindmg selection 
withhGHbp (Example 11) coated on 96-wellMaxisorpimmunoplates as a target. Phage were 
proposed in E. coli SS320 cells with M13-VCS helper phage (Stratagene). After four rounds of 
binding selection, individual clones were analyzed for hGHsm display. For each of the rounds 2, 

50 3, and 4, 24 clones were analyzed. Phage were isolated from each clone and hGHsm display was 
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detected using a phage ELISA (Example 1 3). A single clone from round 2 exhibited 10-fold 
greater binding to plates coated with hGHbp in comparison with plates coated with BSA; all other 
clones exhibited similar binding to either hGHbp or BSA coated plates. The phagemid 
corresponding to the positive clone was designated pS1258. 

The complete DNA sequence of the P12-1 variant encoding ORF of pS1258 was 
determined and the protein sequence was deduced and shown below. The amino acid numbering is 
shown to the right. 



ATG AGC AAG AGC ACT TTC AAA AAG TTT CTG 
10MSKSTFKKFL 



AAA GTT TTT GTT TTT TCT GTT GAT GTT GAT 
KVPVFSVDVD 

15 

AAT AAT TGG ATT TGG GCT GTC GGT ATT ATT 
N NWIWAVGII 



20 



TAC ATG CTC CTC GTG GAG GCG TCG CCC TGG 
YMLLVEASPW 



25 GCT GCT AAG GCG CCA GAC GAT GGT GAA GCT 
AAKAPDDGEA 

(SEQ ID NO. 124) 

30 The new variant coat protein was named P12-7; its sequence differs from that of P12-1 in zone 5. 
The fusion of hGHsm to the C-terminus of P12-7 permits the display of hGHsm on the surface of 
M13 phage, as evidenced by a phage ELISA. 

We also wished to demonstrate that PI 2-7 permits the display of other proteins, for 
example wild-type hGH. A phagemid analogous to pS 1239b (described above) was constructed 

35 and designated pS1239a, with the only difference being that pS1239b encodes a fusion protein 
consisting of P12-1 followed by wild-type hGH (Example 8). Phage particles produced from 
pS1239a did not display hGH at levels detectable in a phage ELISA. The method of Kunkel 
(Example 12) was used with a mutagenic oligonucleotide (add-P12-7) to convert the pS1239a 
DNA sequence encoding PI 2-1 to DNA sequence encoding PI 2-7. The new phagemid was 

40 designated pW930a; it contains an ORF encoding a fusion protein consisting of PI 2-7 followed by 
wild-type hGH. Phage particles isolated from E. coli cultures harboring pW930a displayed hGH 
on their surface, as evidenced by a phage ELISA . 



6* 



WO 00/06717 



PCT/US99/16596 



Examolg 26 Y^"^ " f seqi1PTirP<? f ° r ^ ° f * ft aaL teJhB C " termi * US ° f 

the protein TTT C-termin al domain. 

Oligonucleotides for Example 26: 
5 UH-L4: TTCACCTCGAAAGCAAGCNNSNNSNNSNNSCACCAT 

CACCATCACCAT(SEQIDNO: 125) 

UH-L5: TTCACCTCGAAAGCAAGCNNSNNSNNSNNSNNSCACCAT 
CACCATCACCAT(SEQIDNO: 126) 

UH-L6: TTCACCTCGAAAGCAAGCNNSNNSNNSNNSNNSNNSCACCAT 

10 CACCATCACCAT(SEQIDNO:127) 

UH-L8: TTCACCTCGAAAGCAAGCNNSNNSNNSNNSNNSNNSWCWCCACCAT 

CACCATCACCAT(SEQIDNO: 128) 

UH-L10: TTCACCTCGAAAGCAAGCNNSNNSNNSNNSNNSNNSWCWC 
WCWCCACCATCACCATCACCAT (SEQ ID NO: 129) 

15 UHg3-L6- CTGCGTAATAAGGAGTCTNNSNNSNNSNNSNNSNNS 

CACCATCACCATCACCATTAATCATGCCAGTTCTTTTGG (SEQ ID NO: 130) 
UHg3-L8- CTGCGTAATAAGGAGTCTNNSNNSNNSNNSNNSNNS 
NNSNNSCACCATCACCATCACCATTAATCATGCCAGTTCTTTTGG (SEQ ID NO: 131) 
UHg3-L10: CTGCGTAATAAGGAGTCTNNSNNSNNSNNSNNSNNSNNS 

20 NNSNNSNNSCACCATCACCATCACCATTAATCATGCCAGTTCTTTTGG 

(SEQ ID NO: 132) 

Standard molecular biology techniques were used to construct a phagemid designated 
pS 1428d Phagemid P S 1428d is similar to P S1290a, except that the ORF under the control of the 
IPTG-inducible Ptac promoter (New England Biolabs) consists of the maltose binding protem 
signal peptide followed by the C-terminal domain of M13 protein ffl (Lowman et aL (1991) 
Biochemistry, 30:10832). The method of Kunkel (Example 12) was used to fuse libraries to the C 
terminus of the protein ffl C-terminal domain encoded by P S1428d. The libraries consisted of 
random linkers of various lengths followed by a hexaHis flag (HHffifflH). The end result was 
libraries containing ORFs which encoded the C-terminal domain of protein ffl, followed by 
random polypeptide linker sequences, Mowed by the hexaHis flag. The lengths of the hnkers 
were varied and depended on the mutagenic oligonucleotides used: oligonucleotides UHg3-L6, 
UHg3-L8, or UHg3-L10 introduced linkers containing 6, 8, or 10 residues respectively. The 
diversities of the libraries were as follows: UHg3-L6, 3.5 x 10 10 ; UHg3-L8, 1.2 x 10 10 ; UHgS- 



25 



35 L10, 2.8 x 10 10 



Phage from the libraries were pooled together and cycled through binding selection with an 
anti-(His)4 antibody (Qiagen) as the capture target. After two rounds of selection, individual 
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clones were assayed for hexaHis flag display using a phage ELIS A with the anti-(His)4 antibody as 
target. Three clones exhibiting strong signals were subjected to DNA sequence analysis and the 
selected linker sequences are shown below. 



10 



15 



g3-l GGG CAG GCC AGG ATC GTC TAC CGG CAG AAG 
(SEQ ID NO. 133) 

GQAR I VYRQK 

(SEQ ID NO. 134) 



g3-2 AGG ATC AGG GTC CTG CAG AAG GGC AAG GAG 
(SEQ ID NO. 135) 

RIRVLQKGKE 



(SEQ ID NO. 136) 



g3-3 CGC GCC AAG ATC GAG CAG ATC TGC AAG GAG 
(SEQ ID NO 137) 

RAKI EQICKE 



20 (SEQ ID NO. 138) 



The sequences shown were inserted between the final residue of the protein III C-terminal 
domain and a hexaHis flag. For each selectant, the DNA sequence is shown with the deduced 
amino acid sequence below. A designation for each sequence is shown to the left; 

25 The levels of polyHis flag display were compared with levels of display achieved with C- 

terminal or N-terminal fusions to protein VIE. Interestingly, display with C-terminal fusion to the 
protein HI C-terminal domain was equivalent to display with N-terminal fusion to protein VIII and 
about 10-fold higher than display with C-terminal fusion to protein Vm (Fig. 18). 

The foregoing written specification is considered to be sufficient to enable one skilled in 

30 the art to practice the invention. The present invention is not to be limited in scope by the cultures 
deposited, since the deposited embodiments are intended as separate illustrations of certain aspects 
of the invention and any cultures that are functionally equivalent are within the scope of this 
invention. The deposit of material herein does not constitute an admission that the written 
description herein contained is inadequate to enable the practice of any aspect of the invention, 

35 including the best mode thereof, nor is it to be construed as limiting the scope of the claims to the 
specific illustrations that it represents. Indeed, various modifications of the invention in addition to 
those shown and described herein will become apparent to those skilled in the art from the 
foregoing description and fall within the scope of the appended claims. 

While the invention has necessarily been described in conjunction with preferred 

40 embodiments, one of ordinary skill, after reading the foregoing specification, will be able to effect 
various changes, substitutions of equivalents, and alterations to the subject matter set forth herein, 
without departing from the spirit and scope thereof. Hence, the invention can be practiced in ways 
other than those specifically described herein. It is therefore intended that the protection granted 
by Letters Patent hereon be limited only by the appended claims and equivalents thereof. 



WO 00/06717 



PCT/US99/16596 



10 



WHAT IS CLAIMED: 

1 . A fusion protein comprising a heterologous polypeptide fused to a major coat protein of a 
virus, wherein the major coat protein is a variant of a wild type major coat protein of the 
virus. 

2. The fusion protein of claim 1 , wherein the virus is selected from the group consisting of a 
filamentous phage, a lambda phage, a Baculovirus, a T4 phage and a T7 phage. 

3. The fusion protein of claim 1 , wherein the phage is a filamentous phage, the major coat 
protein is gpVffl and the heterologous polypeptide is fused to the N-terminus or the C- 
terminus thereof. 



4. The fusion protein of claim 1, wherein the major coat protein is a filamentous phage coat 
protein variant which contains at least one amino acid residue selected from the list below 
15 in the position indicated: 



20 



25 



30 



35 



Residue Number 


Amino Acid Residue 


1 


E, L, V, Q.D.I.N 


2 


R,H,F,W,E,K,Y,D 


3 


T, E, L, S, D, I, V, A 


4 


D, R, H, E, K 


5 


R, H, N, D, K, Q, E 


6 


Y, W, S,LL,F,T,V 


7 


T,N,S 


8 


D, H, R, E, K 


9 


E,Q,T,D,N,S 


11 


W,I,V,Y,L,F 


12 


R, H, N, E, D, K, Q 


13 


I,L,E,Q,A,V,D,T,N,S 


14 


L.I.V 


15 


D, R, N, E, K, H, Q 


16 


E,V,L,T,D,I,A,S,G 


17 


E,V,L,I,A,T,D 


18 


L.V.I 


19 


L, T, Q, E,I,V,S,A,N,D 


20 


R, D, H, N, Q, K, E 


21 


W,Y,I,L,F,V 


22 


W,F,Y 



Ve 
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23 



W,Y,I, V,H, K, F, L,R 
I, Q, L.N.V 



24 



25 



S, L, I, T, V 
A, I, V, G, L,M 



26 



5 



27 



N, T,S 



28 



I, L, V 

K, R, F, W, H, Y 
I,V, L. 



29 



30 



10 5. 

15 6. 
7. 

20 

8. 
9. 

25 

10. 

30 

11. 
12. 

35 13. 
14. 



A fusion protein comprising a heterologous polypeptide fused to at least a portion of a coat 
protein of a filamentous phage, wherein the coat protein is a variant of a wild type coat 
protein of the phage, the variant having an alteration in the transmembrane domain or in 
the cytoplasmic domain of the coat protein. 

The fusion protein of claim 5, wherein the coat protein is gpm of a filamentous phage and 
the heterologous polypeptide is fused to the N-terminus or the C-terminus thereof. 

The fusion protein of claim 1, wherein the variant has 2-50 altered residues relative to the 
wild type coat protein sequence. 

The fusion protein of claim 1, wherein the heterologous polypeptide is an antibody or a 
fragment thereof or a cytokine or a cytokine receptor. 

A replicable expression vector comprising a gene fusion, wherein the gene fusion encodes 
the fusion protein of claim 1 . 

A library comprising a plurality of the replicable expression vectors of claim 9, the 
expression vectors comprising a plurality of different gene fusions encoding a plurality of 
fusion proteins. 

Host cells comprising the vector of claim 20. 

A virus displaying the fusion protein of claim 1 on the surface thereof. 

A library of virus, comprising a plurality of the virus of claim 12 displaying a plurality of 
different fusion proteins on the surface thereof. 

A method, comprising: 
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constructing a library of phage or phagemid particles displaying a plurality of the fusion 
protein of claim 1; 

contacting the phage or phagemid particles with a target molecule so that at least a portion 
of the particles bind to the target molecule; and 
5 separating the particles that bind from those that do not bind. 

15. A method of decreasing the detection limit of a phage display system utilizing a phage 
containing a gene fusion encoding a fusion protein, where the gene fusion comprises a first gene 
encoding a heterologous polypeptide and a second gene encoding at least a portion of a phage coat 
1 0 protein, the method comprising mutating the second gene to encode a variant of a wild type coat 
protein of the phage. 

16 A method of transforming cells, comprising electroporating cells in the presence of 
heterologous DNA under conditions suitable to allow the heterologous DNA to enter the cells, 
15 wherein the heterologous DNA is purified by affinity purification. 

17. The method of claim 16, wherein the heterologous DNA is present at a concentration of 
about 1 picogram to about 500 micrograms/mL. 

20 18. The method of claim 16, obtaining at least 1 x 10 10 transformants in one electroporating 
step. 

19. The method of claim 16, wherein the cells are present at a concentration of about 1 x 10" 
to about 4 x 10" cfu/mL. 

25 

20. The method of claim 19, wherein the cells are F::Tnl0 P roA + B + lacIW(lacZ)M15f 
F .araD139D{ a ra-leu)7696galE15galKJ6DVacVC74n>s«^^ R C ° H 
cells. 

A method for producing a product polypeptide, comprising the steps of: 
(1) culturing a host cell transformed with a replicable expression vector, the replicable 
expression vector comprising DNA encoding a product polypeptide operably linked to a 
control sequence capable of effecting expression of the product polypeptide in the host 
cell; 

wherein the DNA encoding the product polypeptide has been obtained by a method 
comprising the steps of: 



30 21. 



35 
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35 



16; 



(a) constructing a family of variant replicable plasmids comprising a transcription 
regulatory element operably linked to a gene fusion encoding a fusion protein, wherein the 
gene fusion comprises a first gene encoding a polypeptide and a second gene encoding at 
least a portion of a phage coat protein, wherein the variant replicable plasmids comprise 
variant first genes encoding variant polypeptides; 

(b) transforming suitable host cells with the plasmids using the method of claim 



(c) optionally, when the plasmid is a phagemid which requires a helper phage to 
produce phage particles, infecting the transformed host cells with an amount of helper 

10 phage encoding the phage coat protein sufficient to produce recombinant phagemid 

particles, preferably wherein no more than a minor amount of the phagemid particles 
display one or more copies of the fusion protein on the surface of the phagemid particles; 

(d) culturing the transformed infected host cells under conditions suitable for 
forming recombinant phage particles containing at least a portion of the plasmid and 

1 5 capable of transforming the host cells; 

(e) contacting the recombinant phage particles with a target molecule so that at 
least a portion of the phage particles bind to the target molecule; 

(f) separating phage particles that bind to the target molecule from those that do 
not bind; 

20 (g) selecting one of the variant polypeptides encoded by the plasmid in a phage 

particle which binds or does not bind to the target molecule as the product polypeptide and 
cloning DNA encoding the product polypeptide into the replicable expression vector; and 



(2) recovering the expressed product polypeptide. 

22. A fusion protein comprising at least a portion of a protein III or protein VIII filamentous 
phage coat protein having a heterologous polypeptide fused to the carboxyl-terminus thereof. 

23. A replicable expression vector comprising a gene fusion, wherein the gene fusion encodes 
30 the fusion protein of claim 22. 

24. A library comprising a plurality of the replicable expression vectors of claim 23, the 
expression vectors comprising a plurality of different gene fusions encoding a plurality of fusion 
proteins. 



25. Host cells comprising the vector of claim 23. 

26. A virus displaying the fusion protein of claim 22 on the surface thereof. 



claim. 
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27. A library of vims, comprising a plurality of the virus of claim 26 displaying a plurality of 
different fusion proteins on the surface thereof. 



5 28 . A method, comprising: 

constructing a library of phage or phagemid particles displaying a plurality of the fusion 

protein of claim 22 on the surface thereof; 

contacting the phage or phagemid particles with a target molecule so that at least a portion 
of the particles bind to the target molecule; and 
1 o separating the particles that bind from those that do not bind. 
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Figure 4B 
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Figure 12C 
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(57) Abstract 



The invention relates to particles comprising: a) a protein membrane with a fusion protein which comprises a virus protein, a 
cell-permeability-mediating peptide and a heterologous cell-specific binding site; and b) a nucleic acid which is contained in the protein 
membrane and presents sequences for a virus-specific packaging signal and a structural gene. The invention also relates to methods for 
producing such particles, means suitable for this purpose and the use of the particles in gene therapy. 

(57) Zusammenfassung 

Die vorliegende Erfindung betrifft Partikel, umfassend: (a) eine Proteinhulle mit einem Fusionsprotein, das ein Virus-Protein, 
ein Zellpermeabilitat-vermittelndes Peptid und eine heterologe zellspezifische Bindungsstelle umfasst, und (b) eine in der ProteinhUlle 
vorliegende Nukleinsaure, die Sequenzen fllr ein Virus-spezifisches Verpackungssignal und ein Struktur-Gen aufweist. Femer betrifft die 
Erfindung Verfahren zur Herstellung solcher Partikel und hierfur geeignete Mittel sowie die Verwendung der Partikel zur Gentherapie. 
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Part ike 1 zur Qentherapie 

Die vorliegende Erf inciting betrifft Nukleinsaure enthaltende 
Partikel, die gezielt an Zellen binden und in diese ihre 
5 Nukleinsaure einfuhren konnen. Ferner betrifft die Erfindung 
Verfahren zur Herstellung solcher Partikel und hierfur 
geeignete Mittel sowie die Verwendung der Partikel zur 
Gen therapie. 

10 Fur eine Gentherapie ist es wichtig ein Gentransfer system zu 
haben, das spezifisch ist, d.h. mit dem gewunschte Zellen 
erreicht und in diese Gene eingefuhrt werden konnen. Bei 
Leberzellen kann dies prinzipiell mit einem modif izierten 
Hepatitis B-Virus (HBV) als Vektor moglich, da HBV 

15 leberzellspezif isch ist. Fur andere Zellen, z.B. Fibroblasten, 
existiert jedoch kein Gentransfer system, das bef riedigende 
Ergebnisse liefert. 

Der vorliegenden Erfindung liegt somit die Aufgabe zugrunde, 
20 ein Gentransf ersystem bereitzustellen, das spezifisch ist, 
d.h. mit dem gewunschte Zellen erreicht und in diese Gene 
eingefuhrt werden konnen. 

Erf indungsgemaS wird dies durch die Gegenstande in den 
25 Patentanspriichen erreicht. 

Die vorliegende Erfindung beruht auf den Erkenntnissen des 
Anmelders, dag Nukleinsaure enthaltende Partikel, die ein 
Fusionsprotein aufweisen, welches ein Virus-Protein, ein 

30 Zellpermeabilitat-vermittelndes Peptid, insbesondere ein 
solches der deutschen Patentanmeldung 198 50 718.6, und eine 
heterologe zellspezif ische Bindungss telle umfafit, an 
entsprechende Zellen binden und in diese ihre Nukleinsaure 
einfuhren konnen. Beispielsweise hat er Nukleinsaure 

35 enthaltende HBV- Partikel hergestellt, die an Fibroblasten 
binden und in diese ihre Nukleinsaure einfuhren. Hierzu hat er 
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di e Hepatocyten-Bindungsstelle die. in der Region ^ 
insbesondere zwischen den Ammosauren 21-47, des g 
Oberflaohenproteins von HBV (LHBs) vorliegt, gegen dxe C.SG1 
Inlgrin-Bindungsstelle von Pibronektin ausgetauseht, wober 
» luper^eabUitat-vern.ittelnde Peptid, das in der 
PreS2 von LHBs vorliegt, erhalten blieb. Feme hat er 
plrtikel -It Fibroblasten-spezifitat hergestellt. rnde* er das 
core-Protein von HBV (HBcAg) .it der aSfcl-Integrrn-Brndungs- 
stelle von Fibronektin und de* vorstehenden Zellpermeabrlrtat- 
vemittelnden Peptid verbunden hat. Desweiteren hat er 
eZnnt, daS die in den Partikeln enthaltene Huklemsaure u> 
den Zellen exprimiert wird. 

Erf indungsgen^ warden die ErKenntnisse des welders genutzt, 
Partikel bereitzustellen, umfassend: 



em 



(a) eine Proteinhttlle mit einexa Fusionsprotein, das 
{) I rus-Protein, ein zellper.eabilitat-ver.ittelndes Pep txd 
U nd eine heterologe zellspezif ische Bindungsstelle 

umfafit, und , 
Ib) eine in der Proteinhttlle vorliegende Nuklernsaure dxe 
Seguenzen £ttr ein virus-spezirisches verpackttngssrgnal 
und ein Struktur-Gen aufweist. 

ner Ausdruck -zellpermeabilitat-vermittelndes Peptid- umfaSt 
•egnl Peptide, die substanzen. insbesondere Proteinen. erne 
^ elabUitat ver^itteln kttnnen. Pies — insbesondere 
jen e Peptide, die in - detttschen Patent^ ^g 1* * 
718 6 des Anmelders genannt sind. Besonders d y 
Peptid, das die folgende Aminosaure- (DNA) -Sequenz umfafct: 

?LS3I FSRIGC)? 
CCC ATA ICG TCA ATC T7C TCG AGG ATT GGG GAC CCi 



per Ausdruck -zellspezifisohe Bindungsstelle- umfa&t :eg iche 
"Lungsstellen von Proteinen und sonstigen kleinen Molekulen 
uler welche die Protelne bzw. die Molekttle an sellen brnden 
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konnen. Beispiele solcher Bindungsstellen finden sich in 
Cytokinen und Wachstumsf aktoren. Ferner finden sie sich in 
Liganden von Hormon-, Neurotransmitter-, Blutzellenober- 
flachen- und Int egr in-Rezeptoren . Eine bevorzugte 
5 Bindungsstelle ist die Ct5£l-Integrin-Bindungsstelle von Fibro- 
nektin. Diese wird nachstehend mit RGD bezeichnet und umfaSt 
die Aminosauren Arginin, Glycin und Aspartat. 

Der Ausdruck "Virus" umfaSt DNA- und RNA-Viren, insbesqndere 
10 Adenoviren, Adeno-assoziierte Viren, Vacciniaviren, 
Baculoviren, Hepatitis C-Viren, Hepatitis A-Viren, 
Inf luenzaviren und Hepadna- Viren. Beispiele letzterer sind 
HBV, WHV ("woodchuck hepatitis virus") GSHV ("ground squirrel 
hepatitis virus"), RBSHV ("red-bellied squirrel hepatitis 
15 virus") DHV ("Pekin duck hepatitis virus") und HHV ("heron 
hepatitis virus"), wobei HBV bevorzugt ist. 

Der Ausdruck "Virus-Protein" betrifft jegliches Protein eines 
vorstehenden Virus, das als ganzes oder teilweise in einem 

20 Fusionsprotein zusammen mit einem Zellpermeabilitat-vermit- 
telnden Peptid und einer heterologen zellspezif ischen Bin- 
dungsstelle in Form eines weiteren Peptids vorliegen kann. Das 
Protein kann auch bereits das Zellpermeabilitat-vermittelnde 
Peptid enthalten. Ein Beispiel fvir ein solches Protein ist 

25 LHBs. Dieses wird wie andere Oberf lachen-Proteine und Core- 
Proteine, z.B. HBcAg , bevorzugt. Der Ausdruck "heterolog" 
weist darauf hin, dafi das Protein von Natur aus nicht die 
vorstehende zellspezif ische Bindungsstelle auf weist. Giinstig 
kann es sein, wenn die homologe, d.h. von Natur aus 

30 vorliegende Bindungsstelle des Proteins ausgeschaltet ist. 

Besonders giinstig kann es sein, wenn die homologe durch die 
heterologe Bindungsstelle ersetzt ist. 

Der Ausdruck "Nukleinseiure" umfaSt RNA und DNA, wobei beide 
35 einzelstr£ngig und/oder doppelstrangig sein konnen. 



Der Ausdruck "Virus-spezif isches Verpackungs signal " weist auf 
eine Signalsequenz in vorstehender Nukleinsaure hin, mittels 
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di eser die Kukleinsaure in die Proteinhulle eines Parcels 
veroackt wird. Die Signalsequenz ist spezifisch fur em 
I Z tenendls vims, sine bevorzugte Signalseguenz ist iene von 
HBV. Diese £indet siob an£ der HBV-DHA und „rd u der 
Literatur mit Epsilon bezeichnet. 

per Ausdruck -Struktur-Gen- umjafct Gene, die far peptide 
"proteine, kodieren. Beispiele von Polypeptxden sxnd 
^ornekrosefaktor, Interferon, mterieukine . «££Z 
Wachstumsfaktoren. Plaszaproteine, z.B. Gerxnnungsf aktoren und 
"bseienzy^e. und Pezeptoren. xnsbesondere k^en d e 
Polypeptide aolche sein. welche die I^unogenxtat von zellen 
stellern Kdnnen. Dies kdnnen Polypeptide sein. die Tu.orzellen 
£ eh\r . B. cytokine, wie IL-2 und GM-CSF , und 
fehlen, z.a. y .... B7 ., tumor-assoziierte 

^TTrZn Z nase"n le und "i-le Polypeptide. ,B. 
r i :r'ranerPapUlo,vi r us und -,-3-Polypeptid von 
Uin-B—Virus. Perner k*nnen die Polypeptx e = 
« ^„r, c »r^^■irte oligomerisierungsmotive ernes foxyp p 
llZT^^e von Virus-Hlllpolypeptiden und _ 
self Femer^mfaSt der Ausdruck "Struktur-Gen" Antxsense- 
Xnu^eotide, Peptid-NukleinsMuren, Consensus-Seguenzen fur 
Transkriptionsf aktoren und Ribozyme. 

«v-/i<=,« Partikel bevorzugt, die ein 
Erfindungege^S FragMnte aavon 

Fusi onsprotexn - tha ^J" telle insb esondere RGD, u*faSt. 
und eine heterologe Bindungsstene, xns 

und erne n he terologe Bindungsstelle , 

Gunstxg rst es, wenn Bindungss telle 

— — . -. = = — r 

auf weist . 

n es weiteren werden Partikel bevorzugt. die ein Pusionsprotein 
Des wezteren Zell permeabilitat-vermittelndes 

^IH'b ein soxl^'aer deutscben Patentan.eldung ». 50 
* insbesondere mit der vorstebend angegebenen ^inos.u- 
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resequenz, und eine heterologe Bindungsstelle, insbesondere 
RGD, umfafit. Besonders bevorzugt ist es, wenn das 
Fusionsprotein die Aminosauresequenz von Fig. 2 oder eine 
hiervon durch eine oder raehrere Aminosauren unterschiedliche 
5 Aminosauresequenz aufweist. 

Der Ausdruck "eine durch eine oder mehrere Aminosauren 
unterschiedliche Aminosauresequenz" weist darauf hin, daS 
diese Aminosauresequenz ein Fusionsprotein kennzeichnet , das 
10 vergleichbare Elemente und Funktionen wie das Fusionsprotein 
von Fig. 1 oder Fig. 2 aufweist, sich allerdings bis zu 20 %, 
vorzugsweise 10 %, von der Aminosauresequenz von Fig. 1 oder 2 
unterscheidet . 

15 Ein erf indungsgemaSes Partikel kann durch ubliche Verfahren 
hergestellt werden. Enthalt das Partikel z.B. ein 
Fusionsprotein, das ein LHBs umfaSt, in dem die homologe durch 
eine heterologe Bindungsstelle, insbesondere RGD, ersetzt ist, 
so ist ein Verfahren giinstig, das folgende Verf ahrensschritte 

20 aufweist: 

(a) Co-Transf ektion von Zellen, die fur ein Heptatitis 
B-Virus-Genom kodieren, wobei diese kein LHBs 
exprimieren, mit einem ersten Express ionsvektor, der 

25 fur ein Fusionsprotein kodiert, das ein LHBs umfaSt, 

in dem die homologe durch eine heterologe 
Bindungsstelle, insbesondere RGD, ersetzt ist, und 
mit einem zweiten Expressionsvektor, der ein Virus- 
spezifisches Verpackungs signal und ein Struktur-Gen 

30 aufweist, und 

(b) Isolierung und Reinigung des Partikels. 

Enth&lt das Partikel ein Fusionsprotein, das ein HBcAG, ein 
35 Zellpermeabilitat-vermittelndes Peptid gem£& der deutschen 
Patentanmeldung 198 50718.6, insbesondere jenes mit 
vorstehender Aminosauresequenz, und eine heterologe 
Bindungsstelle, insbesondere RGD, umfafct, so ist ein Verfahren 
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gunstig, das folgende Verf ahrensschritte umfaSt: 

(a) co-Transfektion von Zellen, die fur eine HBV- 
Polymerase kodieren, rait einem ersten 

a-t fflr ein Fusionsprotein 
Express ionsvektor, der fur em 
kodiert, das ein HBcAg, ein Zellpermeabxlitat 
vermittelndes Peptid ge^S der deutschen Patentan- 
meldung 198 50718.6, insbesondere Denes nut 
: LteLder Aminosaureseguenz, und eine heterology 
Bindungsstelle, insbesondere ROD, umfafct, ^ 
einem zweiten Expres s ionsvektor , der em Vxrus 
^ifiscbes verpackungssignal und ein Struktur-Gen 
aufweist, und 
(b) isolierung und Reinigung des Partikels. 
«• ichtlich der Ausdracke "Expressionsvektor" , "Zellen", und 

ebenfalls einen o y vorstehende 
Bezuglich der anderen Ausdrucke w.rd auf 

Ausfiihrungen verwiesen. 

>prer oegenstand der vorliegenden Erfindung 1st ein 

"ete^n dnrch ein ode, — n 

unterschiedliche Amino sauresequenz . 

Fusionsprotein. das «n HBcAg, * Bin a u n g sstelle. 
vermittelndes Pept^d und eine heterol °* ^ 

opn nmfafit vorzugsweise umfafct aas 
insbesondere RGD, umfafit. vo g 

protein die Aminosauresequenz von Fig 2 ° e 
durch eine oder mehrere Aminosauren untersch 
Aminosauresequenz . 
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Hinsichtlich des Ausdrucks "eine durch eine oder mehrere 
Aminosauren unterschiedliche Aminosauresequenz" wird auf 
vorstehende Ausfiihrungen verwiesen. 

Ein weiterer Gegenstand der vorliegenden Erfindung ist eine 
Nukleinsaure, die fur ein vorstehendes Fusionsprotein kodiert. 
Die Nukleinsaure kann eine RNA oder eine DNA sein. Bevorzugt 
ist eine DNA, die folgendes umfaSt: 

(a) Die DNA von Fig. 1 oder 2 oder eine hiervon durch ein 
oder mehrere Basenpaare unterschiedliche DNA, oder 

(b) eine mit der DNA von (a) uber den degenerierten 
genet ischen Code verwandte DNA. 

Der Ausdruck "eine durch ein oder mehrere Basenpaare 
unterschiedliche DNA" 

weist darauf hin, daS diese DNA fur ein Fusionsprotein 
kodiert, das vorgleichbare Elemente und Funktionen wie das 
Fusionsprotein von Fig. 1 oder 2 aufweist, sich allerdings von 
der Basensequenz von Fig. 1 oder 2 derart unterscheidet , daS 
in der Aminosauresequenz ein Unterschied von maximal 2 0 %, 
vorzugsweise 10 %, vorliegt. 

Eine erf indungsgem&fce DNA kann als solche oder in einem Vektor 
vorliegen. Insbesondere kann eine erf indungsgemaSe DNA in 
einem Express ionsvektor vorliegen. Beispiele solcher sind dem 
Fachmann bekannt. Im Falle eines Expressionsvektors fur E. 
coli sind dies z.B. pGEMEX, pUC-Derivate, pGEX-2T, pET3b und 
pQE-8. Fur die Expression in Hefe sind z.B. pYlOO und Ycpadl 
zu nennen, wahrend fiir die Expression in tierischen Zellen 
z.B. pKCR, pEFBOS, cDM8 , pCEV4 , pCDNA3 , pKSVIO, pRCMV und pRK5 
anzugeben sind. Fiir die Expression in Insektenzellen eignet 
sich besonders der Bacculovirus-Expressionsvektor pAcSGHisNT- 
A. 

Der Fachmann kennt geeignete Zellen, urn die erf indungsgemafie, 
in einem Expressionsvektor vorliegende DNA zu exprimieren. 
Beispiele solcher Zellen umfassen die E . coli-StSmme HB101, 
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DH1 X1776, JH101. » 109. BL21. S» .13009 und 
Dtu, I i \j i fiprischen zellen L, 

Mf-St— saccharoses cerevrsrae, dr die 
OTH 3T3, FM3A, CHO, COS, Vero, HeLa. Hep62, CCL13 
XnseKtenzellen S£9 und am und die Pf lanzenzellen Luprnus 



albus . 



per EachNann Kennt verfahren und Bedingungen Zellen *it eine* 
Z er £ indungsge.aSe » enrnalrenden r 

v^w trans f izieren und die Zelien 
tr .n.for»x.ren b - £^ ^ ^ ^ 

kultivxeren. Auch sxnd mm virus _p rote in zu isolieren 

erfindungsgemase DNA exprxmxerte Vxrus Protexn 

und zu reinigen. 

Kin weiterer Gegenstand der vorliegenden Erfindung ist ein 

»i vorsteLndes Pusionsprotein gerichteter AnUkorper. 
geg en ern vor St ehen ^ ver£ahren 

Kaninchen oder Hunner Fusionsprotein zu 

ir^unxsxeren. Wextere . Boo ale ^orper kann 

dem Fusionsprotexn erf olge n . Der j y ^ ^ 

dann aus dem Serura bzw. Exgelb der Txere erh 
den monoklonalen Antikorper werden Mxlzzellen 
Myelomzellen fusioniert. 

El „ weiterer Gegenstand der ^^J^^ZrZ 
Kit. Ein solcher umfaEt eine oder mehrere 

Komponenten : 

(a) ein erf indungsgemafces Fusionsprotein, 

(b) eine erf indungsgemaSe DNA, 
einen erf indungsgemaSen Antikorper, sowxe 
Cliche Hilfsstoffe, wie Tr.ger, Puffer, LSsungsmxttel, 



(O 
(d) 



Kontrollen, etc. 
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auf vorstehende Ausfiihrungen verwiesen. 
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Die vorliegende Erfindung stellt ein Gentransf er-System 
bereit, das spezifisch ist, d.h. mit dem gewunschte Zellen 
5 erreicht und in diese Gene eingefiihrt werden konnen. Die 
Zellen konnen einzeln oder in einem Gewebe vorliegen. Ferner 
konnen die Zellen isoliert oder im Korper eines Individuums 
vorliegen. Somit eignet sich die vorliegende Erfindung fur 
eine ex vivo bzw. in vivo Gentherapie von Zellen bzw. Geweben. 
10 Die Anwendung der vorliegenden Erfindung kann dabei durch 
erf indungsgemafee Antikorper uberwacht und gesteuert werden. 

Somit stellt die vorliegende Erfindung einen grofien Schritt 
dar gentherapeutische Veranderungen an Zellen bzw. Geweben 
15 gezielt durchzufiihren. 



Kurze Beschreibung der Zeichnungen. 

20 Fig. 1 zeigt die Aminosaure- und DNA-Sequenzen eines 

erf indungsgemafien Fusionsproteins, das ein LHBs und 
die heterologe Bindungsstelle RGD umfaSt, wobei 
diese die homolge ersetzt. 

25 Fig. 2 zeigt die Aminosaure- und DNA-Sequenzen eines 

erf indungsgemafien Fusionsproteins, das ein HBcAg , 
ein Zellpermeabilitat-vermittelndes Peptid der 
vorstehenden Aminosauresequenz und die heterologe 
Bindungsstelle RGD umfaSt. 

30 

Die vorliegende Erfindung wird durch die nachstehenden 
Beispiele erl&utert. 



35 
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Beispiel l: Herstellung eines erfiindungsgeaafien Partikels, 
86 P daS ein Fusionsprotein enthalt, welches exn 

LHBs und eine heterologe Bindungsstelle umfafit. 



20 



25 



(A) 



Herstellung eines Expressionsvektor, der fiir 
samtliche HBV-spezif ischen Proteine mit Ausnahme von 



LHBs kodiert. 



10 K^pien Ls HBV-Genoms. ~ « J* - 

Kopxe) amplxfxzxert GAT ATQ CAG CAC 

fnlnende Sequenz auf : CCA iai iv^j. ±w 
« ggg GC E L EcoRV-Schnittstelle ist unterstrxchen. Das 
15 GGG GC. wne 9fU9 _ 2 852 ersetzt das ATG-Startcodon 

Triplet* ACG zwxschen nt 2849 2852 ers 
, m « t HBs Der backward Primer (nt 2877-2845) weist 
von LHBS - ° er GTQ QAA GATVATC TGC CCC GTG CTG. Exne 

^LhnittTtelTe xst un terstrichen. Das Triplett CGT 

EC ° RV : SC Tt 2852 2849 ersetzt das naturliche Triplett CAT. 
zwxschen nt 2852 2849 verda ut und ttber ein 

Erhaltene PCR-Fragmente werden mxt EcoRV ver 

i%iaes Agarosegel gerexnxgt. Exn Fragment 

P T 3 3 4 «ird aus den, Gel eluiert ued su t be„ahrt. 

r,rnRe von ca. 3 , J kd whu 



Tn einer zwei ten PCR werden ein forward Primer, der eine 
in exner zwexte nac hstehenden Seguenz 

EcoRV-Schnittstelle gefolgt von der na CA 

30 nachstehenden Seguenz ntHBV 2830 l Erha itende PCR- 

ATC TTG TTC CCA AGA ATA TGG) auf wexst . Erhaitende P 

TTG ti verd aut und liber ein praparatxves 1 

Fragmente werden mxt EcoRV - rd erwarteten GrdSe 

* Arr^-rose-Gel geremigt. Em Fragment 

17au deToel eluiert und dephosphoryliert. Dieses Fra^eet 
wxrd aus aem o gro 6en in eine Lxgasere- 

akt ion emgesetzt, »° durch wression5Vett or kodiert 
pTKTHBV2Ldef erhalten wird. Dieser Expressio 

samtUche HBV-spesi £ isehen Preteine mt Ausnatoe ven LHBs. 
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(B) Herstellung eines Expressionsvektors, der ftir ein 
Fusionsprotein kodiert, welches ein LHBs und die 
heterologe Bindungsstelle RGD umfaSt. 

5 

Ausgehend von dem Plasmid pTKTHBV2 (vgl. vorstehend) wird das 
Fragment ntHBV2990-834 durch PCR amplif iziert . Der 5* -Primer 
weist folgende Sequenz auf: AAA AGA TCT GGC CGT GGC GAA GGA 
GCT GGA GCA TTC. Diese umfaSt eine Bglll-Schnittstelle, 

10 gefolgt von einem ATG-Startcodon und der fur das Tripeptid 
RGD-kodierenden Sequenz. Der PreSl-spezif ische Leserahmen wird 
genutzt. Der 3 1 -Primer wiest die folgende Sequenz auf: AAA AGA 
TCT GGT TTA AAT GTA TAC CCA AAG. Diese umfaSt eine Bglll- 
Schnittstelle . Erhaltene PCR-Fragmente werden mit Bglll 

15 verdaut und in den mit Bglll-gespaltenen und 
dephosphorylierten Vektor pCDNA.3 (Invitrogen) inseriert, 
wodurch der Express ions vektor pCRGDLHBs erhalten wird. Dieser 
Express ions vektor kodiert fur ein N-terminal verkurztes LHBs, 
das die RGD-Bindungsstelle umfaSt. 

20 

(C) Herstellung eines ein Struktur-Gen und ein 
Verpackungs signal aufweisenden Expressionsvektors 

Es wird eine fur das HBV-Verpackungssignal Epsilon kodierende 
25 Sequenz, z.B. ntHBV 1840-1914, mittels PCR amplif iziert . Durch 
die verwendeten Primer wird eine EcoRV-Schni tts telle 
eingeftthrt. Die Sequenz des forward Primers lautet : CCC GAT 
ATC ATG TCA TCT CTT GTT CAT GTC CTA. Die Sequenz des backward 
Primers lautet: GGG GAT ATC GGT CGA TGT CCA TGC CCC AAA. 
30 Erhaltene PCR-Fragmente werden mit EcoRV gespalten und in den 
mit EcoRV gespaltenen und dephosphorylierten Vektor pCDNA.3 
(vgl. vorstehend) inseriert,. wodurch der Vektor pcVPHBV 
erhalten wird. Dieser Vektor enth&lt das HBV-spezif ische 
Verpackungssignal Epsilon. 

35 

Ausgehend von dem Vektor pCeGFP (Invitrogen), der fur ein 
"green fluorescent protein" unter der Kontrolle des CMV- 
Promotors kodiert, wird die den CMV-Promotor und das GFP-Gen 
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enthaltende Sequenz mittels PCR amplif iziert . Der forward 
Primer hat folgende Sequenz: GGG GGA TCC CGA TGT ACG GGC CAG 
ATA TAC GCG TTG . Der backward Primer hat folgende Sequenz: GGG 
GGA TCC GCG GCC GCT TTA CTT GTA. Die verwendeten Primer 
enthalten jeweils eine BamHI-Schnittstelle. Erhaltene PCR- 
Fragment e werden mit BamHI gespalten und in den mit BamHI 
gespalt enen und dephosphorylierten Vektor pCVPHBV (Invitrogen) 
inseriert, wodurch der Express ionsvektor pCVPHBVeGFP erhalten 
wird. Dieser Expressionsvektor en thai t das HBV-spezif ische 
Verpackungssignal Epsilon, den CMV- Promoter und eine fur eGFP 
kodierende Sequenz. 

(D) Herstellung einer Verpackungszellinie 

Etwa 0.8xl0 6 HepG2-Zellen werden mit Afig von pTKTHBV2 Lde f (vgl. 

(A) ) und 2 fxg pCDNA.3 (vgl. (B) ) mittels Lipofektion 
transf iziert. pCDNA.3 kodiert fur G418 Resistenz. 2h nach 
Transfektion werden die Zellen in ein 700 mg G418/1 
enthaltendes Medium uberfiihrt. G418 resistende Klone werden 
nach 14d subkultiviert . Die stabile Integration von 
pTKTHBV2 Lde f wird mittels PCR und Southern Blots nachgewiesen. 
Die Expression des Oberf lachenproteins SHBs von HBV und von 
HBcAg wird mittels spezifischer Antikorper in ELISAS 
nachgewiesen. Es wird die Verpackungszellinie HepG2- 
TKTHBV2 Lde f erhalten. Diese Zellinie exprimiert samtliche HBV- 
spezif ischen Proteine mit Ausnahme von LHBs . 

(E) Herstellung erf indungsgem&fcer Partikel 

Etwa 0.8xl0 6 Zellen der Verpackungszellinie von (D) werden mit 
3/ig von pCRGDLHBs (vgl. (B) ) und 3jig von pCVPHBVeGFP (vgl. 

(B) ) mittels Lipofektion transf iziert . 72 h nach Transfektion 
weren die Zellen bzw. deren Uberstande gesammelt und einer 
PEG-Fallung unterworfen. AnschlieSend wird eine CsCl- 
Dichtegradienten-Zentrifugation durchgef ahrt . Es werden 
erfindungsgemaSe Partikel in reiner Form erhalten. Diese 
Partikel umfassen samtliche HBV-spezif ischen Proteine mit 
Ausnahme von LHBs, das durch ein RGD-LHBs ersetzt ist. 
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Beispiel 2: Herstellung eines erf indungsgem&fien Partikels, 
das ein Fusionsprotein enthait, welches ein 
HBcAg, ein Zellperxneabilitat-vermittelndes 
5 Peptid und eine heterologe Bindungsstelle 

umfaSt . 

Es wird eine fur ein Zellpermeabilitat-vermittelndes Peptid 
(nachstehend mit ZPP bezeichnet) kodierende DNA verwendet. 

10 Diese hat folgende Sequenz: XXX AGA TCT ATG CCC ATA TCG TCA 
ATC TTC TCG AGG ATT GGG GAC CCT GGA TCC XXX (X bezeichnet ein 
beliebiges Nukleotid) . Diese weist am 5 1 -Ende eine Bglll- 
Schnittstelle, gefolgt von einem ATG-Startcodon und an ihrem 
3 1 -Ende eine BamHI-Schnittstelle auf. Ein doppelstrangiges 

15 DNA-Molekiil, das auf vorstehender Sequenz basiert, wird mit 
BamHI/Bglll geschnitten und in den mit BamHI-gespaltenen und 
dephosphorylierten Expressionsvektor pCDNA.3 (vgl. vorstehend) 
inseriert, wodurch der Expressionsvektor pCZPP erhalten wird. 

20 Ferner wird der Expressionsvektor pTKTHBV2 (vgl. vorstehend) 
verwendet, urn das Fragment nt-HBV 1861-213 6 mittels PCR zu 
amplif izieren. Der forward- Primer umfaSt die folgende Sequenz: 
XXX GGA TCC ACT GTT CAA GCC TCC AAG CTG. Diese umfaSt eine 
BamHI-Schnittstelle gefolgt von der Sequenz ntHB 1861-1881. 

25 Der backward Primer umfaSt die folgende Sequenz: XXX GAA TTC 
TGG ATC TTC CAA ATT AAC ACC CAC CCA. Diese umfaSt eine EcoRI 
Schnittstelle gefolgt von der Sequenz ntHBV 2139-2116. In 
einer zweiten PCR wird das Fragment ntHBV 2140-2480 
amplif iziert, das an seinem 5 ' -Ende mit der fur das RGD-Motiv 

30 kodierenden Sequenz erweitert ist. Der forward Primer umfaSt 
die folgende Sequenz: XXX GAA TTC CGA GGC GAC GCG TCT AGA GAC 
CTA GTA GTC . Diese umfaSt eine EcoRI-Schnittstelle gefolgt von 
der fiir das RGD-Motiv kodierenden Sequenz und der Sequenz 
ntHBV2140-2161 . Der backward Primer umfaSt die folgende 

35 Sequenz: XXX AAG CTT TCC CCA CCT TAT GAG TCC AAG. Diese umfaSt 

eine Hindlll-Schnittstelle und die Sequenz ntHBV 2480-2460. 

Erhaltene Fragmente beider PCRs werden mit EcoRI gespalten und 
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miteinander ligiert. Das Ligationsprpdukt wird als Template 
fur eine weitere PCR verwendet, wobei als forward Primer jener 
der ersten PCR und als backward Primer jener der zweiten PCR 
verwendet werden. Erhaltene PCR-Fragmente werden mit BamHI/- 
5 Hindlll gespalten und in den mit BamHI/Hindlll-gespaltenen und 
dephosphorylierten Vektor pCZPP inseriert, wodurch der 
Expressionsvektor pCZPPHBcRGC erhalten wird. Dieser 
Express ionsvektor kodiert fur HBcAg, das N- terminal die ZPP- 
Sequenz und im Bereich der Aminosauren 79-82 die RGD-Sequenz 
10 enthalt. 

Des weiteren werden etwa 0.8xl0 6 HepG2-Zellen mittels 
Liopfektion mit 4/ig eines fur HBV- Polymerase kodierenden 
Express ionsvektors und mit 2 /ig pCDN3 transf iziert . Es wird 
15 auf die vorstehende Beschreibung von Beispiel 1, (D) verwiesen. 

Es wird eine mit HepG2-HBV Pol bezeichnete Zellinie erhalten. 

Etwa 0.8xl0 6 Zellen der Zellinie HepG2-HBV Pol werden mit 3/xg 
von pCZPPHBc RGC und 3fig von pCVPHBVeGFP (vgl . Beispiel 1,6) 
20 mittels Lipofektion transf iziert . Es wird auf vorstehende 
Beschreibung von Beispiel 1,(E) verwiesen. Es werden 
erf indungsgemaSe Partikel in reiner Form erhalten. 

Beispiel 3: Nachweis der Expression einer in 
25 erfindungsgemafcen Partikeln vorliegender 

NUkleinsaure in Fibroblasten 

Etwa IxlO 9 erf indungsgemaSe Partikel von Beispiel 1 (E) bzw. 
Beispiel 2, werden in lOOjil 0,9 % Saline gelost und in die 
30 Schwanzvene von balb/c Mausen inj iziert. 48 h nach Injektion 
wird der Soleus- und der Tibialis anterior Muskel isoliert und 
in einem "tissue tag" langsam eingefroren. Aus den eingefrore- 
nen Praparaten werden Kryoschnitte angefertigt und diese unter 
einem Fluoreszenzmikroskop bei Blauanregung analysiert. 

35 

Es wird eine grune Fluoreszenz in den Fibroblasten erhalten, 
was die Expression des "green fluorescent protein" zeigt. 
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Beispiel 4: Herstellung und Reinigung eines 
erf indungsgemaSen Fusionsproteins 

5 Es wird das erf indungsgemaSe Fusionsprotein von Fig. 1 
hergestellt. Hierzu wird die DNA von Fig. 1 am 5 ' -Ende mit 
einem Bglll-Linker. und am 3 ' -Ende rait einem Bglll-Linker 
versehen und mit den entsprechenden Restriktionsenzymen nach- 
gespalten. Das erhaltene Bglll/Bglll-Fragment wird in den Bam- 

10 Hl-gespaltenen Expressionsvektor pQE8 inseriert, so daS das 
Expressionsplasmid pQE8/LHBs erhalten wird. Ein solches 
kodiert fur ein Fusionsprotein aus 6 Histidin-Resten (N- 
Terminuspartner) und dem erf indungsgemaSen Fusionsprotein von 
Fig. 1 (OTerminuspartner) . pQE-8/LHBs wird zur Transformation 

15 von E.coli SG 13009 (vgl. Gottesman, S. et al . , J. Bacteriol. 

148, (1981), 265-273) verwendet . Die Bakterien werden in einem 
LB-Medium mit 100/ig/ml Ampicillin und 25^g/ml Kanamycin 
kultiviert und 4 h mit 6Q(M Isopropyl-E-D-Thiogalactopyranosid 
(IPTG) induziert. Durch Zugabe von 6 M Guanidinhydrochlorid 

20 wird eine Lyse der Bakterien erreicht, anschlieSend wird mit 
dem Lysat eine Chromatographie (Ni-NTA-Resin) in Gegenwart von 
8 M Harnstoff entsprechend der Angaben des Herstellers (Qia- 
gen) des Chroma tographie-Materials durchgefiihrt . Das gebundene 
Fusionsprotein wird in einem Puffer mit pH 3,5 eluiert. Nach 

25 seiner Neutralisierung wird das Fusionsprotein einer 18 % SDS- 
Polyacrylamid-Gelelektrophorese unterworfen und mit Coomassie- 
Blau angefarbt (vgl. Thomas, J.O. und Romberg, R.D., 
J.Mol.Biol. 149 (1975), 709-733). 

30 Es zeigt sich, daS ein erf indungsgemaSes Fusionsprotein in 
hochr einer Form hergestellt werden kann. 

Beispiel 5: Herstellung und Nachweis eines 
erf indungsgexnafien Antikdrpers 

35 

Ein erf indungsgemaSes Fusionsprotein von Beispiel 4 wird einer 
18 %igen SDS-Polyacrylamid-Gelelektrophorese unterzogen. Nach 
Anfarbung des Gels mit 4 M Natriumacetat wird eine 38 kD Bande 
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aus dem Gel herausgeschnitten und in_ Phosphat gepufferter 
Kochsalzlosung inkubiert. Gel-Stucke werden sedimentiert , 
bevor die Proteinkonzentration des Uberstandes durch .eine SDS- 
Polyacrylamid-Gelelektrophorese, der eine Coomassie-Blau- 
Farbung folgt, bestimmt wird. Mit dem Gel-gereinigten 
Fusionspolypeptid werden Tiere wie folgt immunisiert: 

Immunisierungsprotokoll fur polyklonale Antikorper im 
Kaninchen 

Pro immunisierung werden 3 5 /ig Gel-gereinigtes Fusionsprotein 
in 0,7 ml PBS und 0,7 ml komplettem bzw. inkomplettem Freund's 
Adjuvans eingesetzt. 

Tag O: 1. Immunisierung (komplettes Freund's Adjuvans) 
Tag 14: 2. Immunisierung ( inkomplettes Freund's Adjuvans; 



Das Serum des Kaninchens wird im Immunoblot getestet. Hierzu 
wird ein erf indungsgemaSes Fusionsprotein von Beispiel 4 einer 
SDS-Polyacrylamid-Gelelektrophorese unterzogen und auf ein 
Nitrocellulosef ilter iibertragen (vgl . Khyse -Anders en, J., J. 
Biochem. Biophys . Meth. 10, (1984), 203-209). Die Western 
Blot-Analyse wurde wie in Bock, C.-T. et al . , Virus Genes 8, 
(1994), 215-229, beschrieben, durchgefuhrt . Hierzu wird das 
Nitrocellulosefilter eine Stunde bei 37°C mit einem ersten 
Antikorper inkubiert. Dieser Antikorper ist das Serum des 
Kaninchens (1:10000 in PBS). Nach mehreren Waschschritten mit 
PBS wird das Nitrocellulosefilter mit einem zweiten Antikorper 
inkubiert. Dieser Antikorper ist ein mit alkalischer Phospha- 
tase gekoppelter monoklonaler Ziege Ant i -Kaninchen- IgG- 
Antikorper (Dianova) (1:5000) in PBS. Nach 30-minutiger 
Inkubation bei 37°C folgen mehrere Waschschritte mit PBS und 
anschlieSend die alkalische Phosphatase-Nachweisreaktion mit 
Entwicklerlosung (36/iM 5 ' Bromo-4-chloro-3-indolylphosphat , 



Tag 28 
Tag 56 
Tag 80 



icFA) 

3. Immunisierung (icFA) 

4. Immunisierung (icFA) 



Ausbluten 
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400^M Nitroblau-tetrazolium, lOOmM Tris-HCl, pH 9.5, 100 mM 
NaCl, 5 mM MgCl 2 ) bei Raumtemperatur , bis Banden sichtbar 
werden . 

5 Es zeigt sich, dafi erf indungsgemaSe , polyklonale Antikorper 
hergestellt werden konnen. 

Immunisierungsprotokoll fttr polyklonale Antikorper im Huhn 

10 Pro Immunisierung werden 40/xg Gel-gereinigtes Fusionsprotein 
in 0,8 ml PBS und 0,8 ml komplettem bzw. inkomp let tern Freund's 
Adjuvans eingesetzt. 

Tag O. 1. Immunisierung (komplettes Freund's Adjuvans) 
15 Tag 28: 2. Immunisierung ( inkomplettes Freund's Adjuvans; 

icFA) 

Tag 50: 3. Immunisierung (icFA) 

Aus Eigelb werden Antikorper extrahiert und im Western Blot 
20 getestet. Es werden erf indungsgemaSe, polyklonale Antikorper 
nachgewiesen . 
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Immunisierungsprotokoll fur monoklonale Antikorper der Maus 

Pro Immunisierung werden 12/xg Gel-gereinigtes Fusionsprotein 
in 0,25 ml PBS und 0,25 ml komplettem bzw. inkomplettem 
5 Freund's Adjuvans eingesetzt; bei der 4. Immunisierung ist das 
Fusionsprotein in 0,5 ml (ohne Adjuvans) geldst. 



10 



Tag O. 
Tag 28: 

Tag 56 
Tag 84 
Tag 87 



1. Immunisierung (komplettes Freund's Adjuvans) 

2 . Immunisierung ( inkomplettes Freund ' s Adjuvans ; 

icFA) 

3. Immunisierung (icFA) 

4. Immunisierung (PBS) 
Fusion 



15 



Uberstande von Hybridomen werden im Western Blot getestet. 
Erf indungsgema&e, monoklonale Antikorper werden nachgewiesen. 
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Patentanspruche 

1. Partikel, umfassend: 

5 (a) eine Proteinhulle mit einem Fusionsprotein, das ein 

Virus-Protein, ein Zellpermeabilitat-vermittelndes 
Peptid und eine heterologe zellspezif ische Bindungs- 
stelle umfaSt, und 
(b) eine in der Proteinhulle vorliegende Nukleinsaure, 
10 die Sequenzen fur ein Virus-spezif isches 

Verpackungssignal und ein Struktur-Gen aufweist. 

2. Partikel nach Anspruch 1, wobei das Virus-Protein von 
einem Adenovirus, Ad en o - a s s o z i i e r t en Virus, 

15 Vacciniavirus, Baculovirus oder Hepadna-Virus stammt. 

3. Partikel nach Anspruch 2, wobei das Hepadna-Virus ein 
Hepatitis B-Virus ist. 

20 4. Partikel nach einem der Anspruche 1-3, wobei das Virus- 
Protein ein Oberf lachenprotein ist. 

5. Partikel nach Anspruch 4, wobei das Oberf lachenprotein 
ein LHBs ist. 



25 



Partikel nach einem der Anspruche 1-3, wobei das Virus- 
Protein ein Core-Protein ist. 



7. Partikel nach Anspruch 6, wobei das Core-Protein ein 
30 HBcAg ist. 

8. Partikel nach einem der Anspruche 1-7, wobei das 
Zellpermeabilitat-vermittelnde Peptid die folgende 
Aminosauresequenz aufweist: 

35 



Partikel nach einem der Anspruche 1-8, wobei die 
heterologe zellspezif ische Bindungsstelle RGD ist. 
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10. Partikel nach einera der Anspriiche 1-9, wobei das 
Fusionsprotein jenes von Fig. 1 oder 2 ist. 

11. Verfahren zur Herstellung des Partikels nach Anspruch 1, 
5 wobei das Fusionsprotein ein LHBs und eine heterologe 

zellspezif ische Bindungss telle enthalt, umfassend die 
folgenden Verf ahrensschritte : 

(a) Co-Transf ektion von Zellen, die fur ein Heptatitis 
10 B-Virus-Genom kodieren, wobei diese kein LHBs 

exprimieren, mit einem ersten Expressionsvektor , der 
fur ein Fusionsprotein kodiert, das ein LHBs und 
eine heterologe zellspezif ische Bindungsstelle 
umfaSt, und mit einem zweiten Expressionsvektor, der 
15 ein Virus-spezif isches 

Verpackungs signal und ein Struktur-Gen aufweist, und 

(b) Isolierung und Reinigung des Partikels. 

20 12. Verfahren zur Herstellung des Partikels nach Anspruch 1, 
wobei das Fusionsprotein ein H B c A g , ein 
Zellpermeabilitat-vermittelndes Peptid und eine 
heterologe zellspezif ische Bindungsstelle aufweist, 
umfassend die folgenden Verf ahrensschritte: 

25 

(a) Co-Transfektion von Zellen, die fur eine HBV- 
Polymerase kodieren, mit einem ersten 
Expressionsvektor, der fur ein Fusionsprotein 
kodiert, das ein HBcAg, ein Zellpermeabilitat- 
30 vermittelndes Peptid und eine heterologe 

zellspezif ische Bindungsstelle umfaSt, und mit einem 
zweiten Expressionsvektor, der ein Virus- 
spezifisches Verpackungssignal und ein Struktur-Gen 
aufweist, und 



35 



(b) Isolierung und Reinigung des Partikels 



13. Fusionsprotein, umfassend ein Virus-Protein, ein 
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Zellpermeabili tat-vermi t t elndes Peptid und eine 
heterologe zellspezif ische Bindungsstelle . 



14. Fusionsprotein nach Anspruch 13, umfassend die 
Aminosauresequenz von Fig. 1 Oder 2 oder eine hiervon 
durch ein oder mehrere Aminosauren unterschiedliche 
Aminosauresequenz . 

15. DNA, kodierend fur das Fusionsprotein nach Anspruch 13. 

16. DNA, kodierend fur das Fusionsprotein nach Anspruch 14, 
umfassend: 

(a) Die DNA von Fig. 1 oder 2 oder eine hiervon durch 
ein oder mehrere Basenpaare unterschiedliche DNA, 
oder 

(b) eine mit der DNA von (a) uber den degenerierten 
gentischen Code verwandte DNA. 

17. Expressionsvektor, kodierend fur die DNA nach Anspruch 
16. 

18. Verwendung des Partikels nach einem der Anspriiche 1-10 
zur Gentherapie von Zellen und Geweben. 
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atgggccgtggcgaaggagctggagcattcgggctgggtttcaccccaccgcacggaggccttttg 
gggtggagccctcaggctcagggcatactacaaactttgccagcaaatccgcctcctgcctccacc 
aatcgccagacaggaaggcagcctaccccgctgtctccacctttgagaaacactcatcctcaggcc 
atgcagtggaattccacaacctttcaccaaactctgcaagatcccagagtgagaggcctgtatttc 
cctgctggtggctccagttcaggagcagtaaaccctgttccgactactgcctctcccttatcgtca 
atcttctcgaggattggggaccctgcgctgaacatggagaacatcacatcaggattcctaggaccc 
cttctcgtgttacaggcggggtttttcttgttgacaagaatcctcacaataccgcagagtctagac 
tcgtggtggacttctctcaattttctagggggaactaccgtgtgtcttggccaaaattcgcagtcc 
ccaacctccaatcactcaccaacctcctgtcctccaacttgtcctggttatcgctggatgtgtctg 
cggcgttttatcatcttcctcttcatcctgctgctatgcctcatcttcttgttggttcttctggac 
tatcaaggtatgttgcccgtttgtcctctaattccaggatcctcaaccaccagcacgggaccatgc 
cgaacctgcatgactactgctcaaggaacctctatgtatccctcctgttgctgtaccaaaccttcg 
gacggaaattgcacctgtattcccatcccatcatcctgggctttcggaaaattcctatgggagtgg 
gcctcagcccgtttctcctggctcagtttactagtgccatttgttcagtggttcgtagggctttcc 
cccactgtttggctttcagttatatggatgatgtggtattgggggccaagtctgtacagcatcttg 
agtccctttttaccgctgttaccaattttcttttgtctttgggtatacatttaaacc 



MGRGDGAGAFGLGFTPPHGGLLGWSPQAQGILETLPANPPPASTNRQSGRQPTPLSPPLRNTHPQA 
MQWNSTTFHQTLQDPRVRGLYFPAGGSSSGTVNPVPTTVSPISSIFSRIGDPALNMENITSGFLGP 
LLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQSPTSNHSPTSCPPTCPGYRWMCL 
RRFIIFLFILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCTTPAQGTSMYPSCCCTKPS 
DGNCTCIPIPSSWAFGKFLWEWASARFSWLSLLVPFVQWFVGLSPTWLSVIWMMWYWGPSLYSIL 

SPFLPLLPIFFCLWVYI 



FIG.l 



ERSATZBLATT (KEGEL 26) 
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atgcccatatcgtcaatcttctcgaggattggggaccctggatccactactgttcaagcctccaag 
ctgtgccttgggtggctttggggcatggacatcgacccttataaagaatttggagctactgtggag 
ttactctcgtttttgccttctgacttctttccttcagtacgagatcttctagataccgcctcagct 
ctgtatcgggaagccttagagtctcctgagcattgttcacctcaccatactgcactcaggcaagca 
attctttgctggggggaactaatgactctagctacctgggtgggtgttaatttggaagatccagaa 
ttccgaggcgacgcgtctagagacctagtagtcagttatgtcaacactaatatgggcctaaagttc 
aggcaactcttgtggtttcacatttcttgtctcacttttggaagagaaaccgttatagagtatttg 
gtgtctttcggagtgtggattcgcactcctccagcttatagaccaccaaatgcccctatcctatca 
acacttccggaaactactgttgttagacgacgaggcaggtcccctagaagaagaactccctcgcct 
cgcagacgaaggtctcaatcgccgcgtcgcagaagatctcaatctcgggaacctcaatgttagtat 
tec 



MPLSSIFSRIGDP TVQASKLCLGWLWGMDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTASALYR 
EALESPEHCSPHHTALRQAILCWGELMTLATWVGVNLEDPEFRGDASRDLVVSYVNTNMGLKFRQL 
LWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTWRRRGRSPRRRTPSPRRR 
RSQSPRRRRSQSREPQC 



Fig. 2 



ERSA TZBLA TT (KEGEL 26) 
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SEQUENZ PROTOKOLL . 



(1) ALLGEMEINE ANGABEN: 
(i) ANMELDER: 

(A) NAME: Dr. Eberhardt Hildt, Klinikum Rechts d. Isar 

(B) STRASSE: Ismaninger Str. 

(C) ORT: Muenchen 

(E) LAND: Deutschland 

(F) POSTLEITZAHL : 81675 

(ii) BEZEICHNUNG DER ERFINDUNG: Partikel zur Gentherapie 
(iii) ANZAHL DER SEQUENZEN: 19 

(iv) COMPUTER -LESBARE FAS SUNG : 

(A) DATENTRAGER: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) BETRIEBSSYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version" #1.30 (EPA) 

(v) DATEN DER JETZIGEN ANMELDUNG : 

ANMELDENUMMER: DE 199 04 800.2 



(2) ANGABEN ZU SEQ ID NO: 1: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 347 Aminos aur en 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS : Peptid 



(xi) SEQUENZ BESCHRE I BUNG : SEQ ID NO: 1: 

Met Gly Arg Gly Asp Gly Ala Gly Ala Phe Gly Leu Gly Phe Thr Pro 
15 10 15 

Pro His Gly Gly Leu Leu Gly Trp Ser Pro Gin Ala Gin Gly lie Leu 
20 25 30 

Glu Thr Leu Pro Ala Asn Pro Pro Pro Ala Ser Thr Asn Arg Gin Ser 
35 40 45 

Gly Arg Gin Pro Thr Pro Leu Ser Pro Pro Leu Arg Asn Thr His Pro 
50 55 60 

Gin Ala Met Gin Trp Asn Ser Thr Thr Phe His Gin Thr Leu Gin Asp 
65 70 75 80 

Pro Arg Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 
85 90 95 

Thr Val Asn Pro Val Pro Thr Thr Val Ser Pro lie Ser Ser He Phe 
100 105 HO 

Ser Arg He Gly Asp Pro Ala Leu Asn Met Glu Asn He Thr Ser Gly 
115 120 125 
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Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr 
130 135 140 

Arg He Leu Thr He Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu 
1*5 150 155 * ' " 160 

Asn Phe Leu Gly Gly Thr Thr Val Cys Leu Gly Gin Asn Ser Gin Ser 
165 170 175 

Pro Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro Thr Cys Pro Gly 
180 185 190 

Tyr Arg Trp Met Cys Leu Arg Arg Phe He He Phe Leu Phe He Leu 
195 200 205 

Leu Leu Cys Leu He Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met 
210 215 220 

Leu Pro Val Cys Pro Leu He Pro Gly Ser Ser Thr Thr Ser Thr Gly 
225 230 235 240 

Pro Cys Arg Thr Cys Thr Thr Pro Ala Gin Gly Thr Ser Met Tyr Pro 
245 250 255 

Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys He Pro 
260 265 270 

lie Pro Ser Ser Trp Ala Phe Gly Lys Phe Leu Trp Glu Trp Ala Ser 
275 280 285 

Ala Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe Val Gin Trp Phe 
290 295 300 

Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val He Trp Met Met Trp 
305 310 315 320 

Tyr Trp Gly Pro Ser Leu Tyr Ser He Leu Ser Pro Phe Leu Pro Leu 
325 330 335 



Leu Pro He Phe Phe Cys Leu Trp Val Tyr He 
340 345 



(2) ANGABEN ZU SEQ ID NO : 2: 

( i ) SEQUENZKENNZEICHEN : 

(A) LANGE: 215 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Peptid 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 2: 

Met Pro Leu Ser Ser He Phe Ser Arg He Gly Asp Pro Thr Val Gin 
15 10 15 

Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He Asp Pro 
20 25 30 

Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu Pro Ser 
35 40 45 
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Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser Ala Leu 
50 55 60 

Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His His Thr 
65 70 75 80 

Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr Leu Ala 
85 90 95 

Thr Trp Val Gly Val Asn Leu Glu Asp Pro Glu Phe Arg Gly Asp Ala 
100 105 110 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
115 120 125 

Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
130 135 140 

Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
145 150 155 160 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
165 170 175 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
180 185 190 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
195 200 205 

Gin Ser Arg Glu Pro Gin Cys 
210 215 



(2) ANGABEN ZU SEQ ID NO: 3: 

( i ) SEQUENZKENNZEICHEN : 

(A) LANGE: 663 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS : CDNA 



(xi) SEQUENZBESCHREIBUNG : SEQ ID NO: 3: 

ATGCCCATAT CGTCAATCTT CTCGAGGATT GGGGACCCTG GATCCACTAC TGTTCAAGCC 60 

TCCAAGCTGT GCCTTGGGTG GCTTTGGGGC ATGGACATCG ACCCTTATAA AGAATTTGGA 120 

GCTACTGTGG AGTTACTCTC GTTTTTGCCT TCTGACTTCT TTCCTTCAGT ACGAGATCTT 180 

CTAGATACCG CCTCAGCTCT GTATCGGGAA GCCTTAGAGT CTCCTGAGCA TTGTTCACCT 240 

CACCATACTG CACTCAGGCA AGCAATTCTT TGCTGGGGGG AACTAATGAC TCTAGCTACC 300 

TGGGTGGGTG TTAATTTGGA AGATCCAGAA TTCCGAGGCG ACGCGTCTAG AGACCTAGTA 360 

GTCAGTTATG TCAACACTAA TATGGGCCTA AAGTTCAGGC AACTCTTGTG GTTTCACATT 420 

TCTTGTCTCA CTTTTGGAAG AGAAACCGTT ATAGAGTATT TGGTGTCTTT CGGAGTGTGG 480 
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ATTCGCACTC CTCCAGCTTA TAGACCACCA AATGCCCCTA TCCTATCAAC ACTTCCGGAA 540 

ACTACTGTTG TTAGACGACG AGGCAGGTCC CCTAGAAGAA GAACTCCCTC GCCTCGCAGA 600 

CGAAGGTCTC AATCGCCGCG TCGCAGAAGA TCTCAATCTC GGGAACCTCA ATGTTAGTAT 660 

TCC 663 



(2) ANGABEN ZU SEQ ID NO: 4: 

(i) SEQUENZKENNZEICHEN: 

(A) L&NGE: 1047 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKtiLS: cDNA 



(xi) SEQUENZBESCHREIBUNG : SEQ ID NO: 4: 



ATGGGCCGTG 


GCGAAGGAGC 


TGGAGCATTC 


GGGCTGGGTT 


TCACCCCACC 


GCACGGAGGC 


60 


CTTTTGGGGT 


GGAGCCCTCA 


GGCTCAGGGC 


ATACTACAAA 


CTTTGCCAGC 


AAATCCGCCT 


120 


CCTGCCTCCA 


CCAATCGCCA 


GACAGGAAGG 


CAGCCTACCC 


CGCTGTCTCC 


ACCTTTGAGA 


180 


AACACTCATC 


CTCAGGCCAT 


GCAGTGGAAT 


TCCACAACCT 


TTCACCAAAC 


TCTGCAAGAT 


240 


CCCAGAGTGA 


GAGGCCTGTA 


TTTCCCTGCT 


GGTGGCTCCA 


GTTCAGGAGC 


AGTAAACCCT 


300 


GTTCCGACTA 


CTGCCTCTCC 


CTTATCGTCA 


ATCTTCTCGA 


GGATTGGGGA 


CCCTGCGCTG 


360 


AACATGGAGA 


ACATCACATC 


AGGATTCCTA 


GGACCCCTTC 


TCGTGTTACA 


GGCGGGGTTT 


420 


TTCTTGTTGA 


CAAGAATCCT 


CACAATACCG 


CAGAGTCTAG 


ACTCGTGGTG 


GACTTCTCTC 


480 


AATTTTCTAG 


GGGGAACTAC 


CGTGTGTCTT 


GGCCAAAATT 


CGCAGTCCCC 


AACCTCCAAT 


540 


CACTCACCAA 


CCTCCTGTCC 


TCCAACTTGT 


CCTGGTTATC 


GCTGGATGTG 


TCTGCGGCGT 


600 


TTTATCATCT 


TCCTCTTCAT 


CCTGCTGCTA 


TGCCTCATCT 


TCTTGTTGGT 


TCTTCTGGAC 


660 


TATCAAGGTA 


TGTTGCCCGT 


TTGTCCTCTA 


ATTCCAGGAT 


CCTCAACCAC 


CAGCACGGGA 


720 


CCATGCCGAA 


CCTGCATGAC 


TACTGCTCAA 


GGAACCTCTA 


TGTATCCCTC 


CTGTTGCTGT 


780 


ACCAAACCTT 


CGGACGGAAA 


TTGCACCTGT 


ATTCCCATCC 


CATCATCCTG 


GGCTTTCGGA 


840 


AAATTCCTAT 


GGGAGTGGGC 


CTCAGCCCGT 


TTCTCCTGGC 


TCAGTTTACT 


AGTGCCATTT 


900 


GTTCAGTGGT 


TCGTAGGGCT 


TTCCCCCACT 


GTTTGGCTTT 


CAGTTATATG 


GATGATGTGG 


960 


TATTGGGGGC 


CAAGTCTGTA 


CAGCATCTTG 


AGTCCCTTTT 


TACCGCTGTT 


ACCAATTTTC 


1020 


TTTTGTCTTT 


GGGTATACAT 


TTAAACC 








1047 
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(2) ANGABEN ZU SEQ ID NO: 5: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 3 5 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM : Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKttLS : Sonstige Nucleinsaure 
(A) BESCHREIBUNG : /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 5: 
CCATATTCTT GGGAACAAGA TATCCAGCAC GGGGC 35 



(2) ANGABEN ZU SEQ ID NO: 6: 

( i ) SEQUENZKENNZEICHEN : 

(A) LANGE: 33 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS : Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 6: 
GGATTGCTGG TGGAAGATAT CTGCCCCGTG CTG 



(2) ANGABEN ZU SEQ ID NO: 7: 

{ i ) SEQUENZKENNZEICHEN : 

(A) LANGE: 33 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKttLS : Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 7: 
CAGCACGGGG CAGATATCTT CCACCAGCAA TCC 



(2) ANGABEN ZU SEQ ID NO: 8: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 3 8 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM : Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKtJLS: Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 
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(xi) SEQUENZBESCHREIBUNG : SEQ ID NO: 8: 
GCCCCGTGCT GGATATCATC TTGTTCCCAA GAATATGG 



(2) ANGABEN ZU SEQ ID NO: 9: 

{ i ) SEQUENZKENNZ E I CHEN : 

(A) LANGE: 36 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM : Einz el Strang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKttLS: Sonstige Nucleinsaure 
(A) BESCHREIBUNG : /desc = " Primer • 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 9: 
AAAAGATCTG GCCGTGGCGA AGGAGCTGGA GCATTC 



(2) ANGABEN ZU SEQ ID NO: 10: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 30 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einz el Strang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKuXS : Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 10: 
AAAAGATCTG GTTTAAATGT AT AC C C AAAG 



(2) ANGABEN ZU SEQ ID NO: 11: 

< i ) SEQUENZKENNZEICHEN : 

(A) LANGE: 33 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKOLS : Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 11: 
CCCGATATCA TGTCATCTCT TGTTCATGTC CTA 



(2) ANGABEN ZU SEQ ID NO: 12: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 30 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 
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(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Sonstige Nucleinsaure 
(A) BESCHREIBUNG : /desc = "Primer" 



(Xi) SEQUENZBESCHREIBUNG : SEQ ID NO: 12: 
GGGGATATCG GTCGATGTCC ATGCCCCAAA 



(2) ANGABEN ZU SEQ ID NO: 13: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 36 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 13: 
GGGGGATCCC GATGTACGGG CCAGATATAC GCGTTG 



(2) ANGABEN ZU SEQ ID NO: 14: 

( i ) SEQUENZKENNZEICHEN : 

(A) LANGE: 27 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKiJLS: Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 14: 
GGGGGATCCG CGGCCGCTTT ACTTGTA 



(2) ANGABEN ZU SEQ ID NO: 15: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 57 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 15: 
NNNAGATCTA TGCCCATATC GTCAATCTTC TCGAGGATTG GGGACCCTGG ATCCNNN 
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(2) ANGABEN ZU SEQ ID NO: 16: 

(i) SEQUENZKENNZEICHEN : 

(A) LANGE: 30 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKuXS : Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG : SEQ ID NO: 16: 
NNNGGATCCA CTGTTCAAGC CTCCAAGCTG 



(2) ANGABEN ZU SEQ ID NO: 17: 

( i ) SEQUENZKENNZEICHEN : 

(A) LANGE: 36 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM : Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKtiLS : Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 17: 
NNNGAATTCT GGATCTTCCA AATTAACACC CACCCA 



(2) ANGABEN ZU SEQ ID NO: 18: 

( i ) SEQUENZKENNZEI CHEN : 

(A) LANGE: 39 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEK0LS : Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 18: 
NNNGAATTCC GAGGCGACGC GTCTAGAGAC CTAGTAGTC 



(2) ANGABEN ZU SEQ ID NO: 19: 

( i ) SEQUENZKENNZEICHEN : 

(A) LANGE: 30 Basenpaare 

(B) ART: Nucleotid 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKOLS: Sonstige Nucleinsaure 
(A) BESCHREIBUNG: /desc = "Primer" 

(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 19: 
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NNNAAGCTTT CCCCACCTTA TGAGTCCAAG 



30 
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